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Methods and Reagents For Modulating TGF-p Superfamilv Signaling 

Background of t he Invention 
5 TGF-P superfamily members signal through activation of 

transmembrane serine-threonine kinase receptors. These receptors 
phosphorylate and activate Smads, a novel class of signal transducers. Signals 
initiated by TGF-P superfamily members are important for regulating cellular 
processes, including cell division, survival, differentiation, and specification of 
10 developmental fate throughout the growth and development of diverse 
organisms. 

During early cmbryogenesis of the frog Xenopus laevis, the TGF-P 
growth factor family plays a central role in the specification and patterning of 
various tissues: TGF-P superfamily members activin, Vg-1, and TGF-P all 

1 5 induce a full range of dorsal and ventral mesodermal markers in early 

embryonic tissue, whereas other TGF-P superfamily members specify axial 
pattern or epidermal, as opposed to neural, tissue. Almost all the critical 
patterning events in early Xenopus embryogenesis appear to involve members 
of the TGF-P superfamily. 

20 The transforming growth factor P (TGF-P) superfamily of cytokines, 

which includes bone morphogenic proteins (BMPs), activin, TGF-P, and Vg-1, 
regulate a wide range of normal and pathological biological processes. These 
processes include cell specification during development, terminal 
differentiation of many cell types, fibrosis during wound healing or organ 

25 damage (e.g., cirrhosis), proliferation and invasiveness of normal and 

transformed cells, and angiogenesis and immune suppression induced by 
tumors (Roberts and Sporn, Peptide growth factors and their receptors I , eds. 
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Sporn and Roberts, Berlin, Springer- Verlage, 419-473, 1990; Sporn et al., 
Science 33: 532*534, 1986). For example, one member of the family, TGF-P, 
is secreted by a wide variety of tumors and has a wide variety of 
immunosuppressive effects, including the ability to induce apoptosis in B and T 
5 lymphocytes (Brabletz et al, Mol. Cell BioL 13: 1 155-1 162, 1993; Cahouchi et 
al, Oncogene 11: 1615-1622, 1995; Weller et al, Exp. Cell Res. 221:395-403, 
1995). The ability to manipulate specific aspects of TGF-P superfamily 
signaling in vivo would be a powerful tool both for understanding the role of 
these factors in normal embryonic patterning and for controlling a broad range 
1 0 of pathological processes. 

Summary of the Invention 
We have discovered methods and reagents for identifying 
compounds that modulate TGF-p superfamily signaling. These methods and 
compounds are useful for the detection and treatment of conditions involving 
15 abnormal TGF-P superfamily signaling. 

In the first four aspects, the invention provides methods for detecting 
compounds capable of modulating TGF-P superfamily signaling. The methods 
include the steps of providing a cell having a reporter gene opcrably linked to a 
DNA-binding-protein recognition site, in addition to having either: 
20 a) a first fusion gene capable of expressing a first fusion protein 

comprising a polypeptide fragment of Smad2 covalently bonded to a binding 
moiety capable of specifically binding to the DNA-binding-protein recognition 
site and a second fusion gene capable of expressing a second fusion protein 
comprising a polypeptide fragment of FAST-1 covalently bonded to a gene 
25 activating moiety, 

b) a first fusion gene capable of expressing a first fusion protein 
comprising a polypeptide fragment of FAST-1 covalently bonded to a binding 



WO 98/53830 



PCI7US98/10983 



-3- 

moiety capable of specifically binding to the DNA-binding-protein recognition 
site and a second fusion gene capable of expressing a second fusion protein 
comprising a polypeptide fragment of Smad2 covalently bonded to a gene 
activating moiety, 

5 c) a first fusion gene capable of expressing a first fusion protein 

comprising a polypeptide fragment of Smad3 covalently bonded to a binding 
moiety capable of specifically binding to the DNA-binding-protein recognition 
site and a second fusion gene capable of expressing a second fusion protein 
comprising a polypeptide fragment of FAST- 1 covalently bonded to a gene 

10 activating moiety, or 

d) a first fusion gene capable of expressing a first fusion protein 
comprising a polypeptide fragment of FAST-1 covalently bonded to a binding 
moiety capable of specifically binding to the DNA-binding-protein recognition 
site and a second fusion gene capable of expressing a second fusion protein 

1 5 comprising a polypeptide fragment of Smad3 covalently bonded to a gene 

activating moiety; exposing the cell to the compound; and measuring reporter 
gene expression in the cell, where a change in the reporter gene expression 
indicates that the compound is capable of modulating TGF-[i superfamily 
signaling. 

20 In the fifth, sixth, seventh, and eighth aspects, the invention features 

a cell useful for detecting a compound capable of modulating TGF-(3 
superfamily signaling, the cell having a reporter gene operably linked to a 
DNA-binding-protein recognition site in addition to having either: 

a) a first fusion gene capable of expressing a first fusion protein 

25 comprising a polypeptide fragment of Smad2 covalently bonded to a binding 
moiety capable of specifically binding to the DNA-binding-protein recognition 
site and a second fusion gene capable of expressing a second fusion protein, the 
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second fusion protein comprising a polypeptide fragment of FAST- 1 covalently 
bonded to a gene activating moiety, 

b) a first fusion gene capable of expressing a first fusion protein 
comprising a polypeptide fragment of FAST- 1 covalently bonded to a binding 

5 moiety capable of specifically binding to the DNA-binding-protein recognition 
site and a second fusion gene capable of expressing a second fusion protein, the 
second fusion protein comprising a polypeptide fragment of Smad2 covalently 
bonded to a gene activating moiety, 

c) a first fusion gene capable of expressing a first fusion protein 
10 comprising a polypeptide fragment of Smad3 covalently bonded to a binding 

moiety capable of specifically binding to the DNA-binding-protein recognition 
site and a second fusion gene capable of expressing a second fusion protein, the 
second fusion protein comprising a polypeptide fragment of FAST- 1 covalently 
bonded to a gene activating moiety, or 

15 d) a first fusion gene capable of expressing a first fusion protein 

comprising a polypeptide fragment of FAST- 1 covalently bonded to a binding 
moiety capable of specifically binding to the DNA-binding-protein recognition 
site and a second fusion gene capable of expressing a second fusion protein, the 
second fusion protein comprising a polypeptide fragment of Smad3 covalently 

20 bonded to a gene activating moiety. 

In preferred embodiments of the first eight aspects of the invention, a 
decrease in reporter gene expression indicates a compound that is capable of 
inhibiting TGF-P superfamily signaling, and an increase in reporter gene 
expression indicates a compound that is capable of enhancing TGF-p 

25 superfamily signaling. In other embodiments of these aspects of the invention, 
reporter gene expression may be assayed by a color reaction or assayed by cell 
viability. In still another embodiment of the first eight aspects of the invention, 



WO 98/53830 



PCT/US98/10983 



-5- 

the cell is a yeast cell. 

In the ninth, tenth, eleventh, and twelfth aspects, the invention 
provides a method for detecting a compound capable of modulating TGF-P 
superfamily signaling. The method comprises the steps of providing a first 
5 polypeptide comprising a polypeptide fragment of FAST- 1 , providing a second 
polypeptide, the second polypeptide comprising a polypeptide fragment of 
either Smad2 or Smad3 (or alternatively, providing a first polypeptide 
comprising a polypeptide fragment of Smad2 or Smad3, and providing a 
second polypeptide comprising a polypeptide fragment of FAST-1), exposing 

1 0 the first polypeptide to the second polypeptide and to the compound, and 

measuring the level of interaction between the first polypeptide and the second 
polypeptide, wherein an alteration in the level of interaction indicates that the 
compound is capable of modulating TGF-P superfamily signaling. 

In one preferred embodiment of the ninth, tenth, eleventh, and 

1 5 twelfth aspects of the invention, at least one of the first polypeptide or the 
second polypeptide is immobilized on a solid-phase substance. In another 
preferred embodiment, a decrease in the level of interaction indicates that the 
compound is capable of inhibiting TGF-P superfamily signaling, and an 
increase in the level of interaction indicates that the compound is capable of 

20 enhancing TGF-P superfamily signaling. In other embodiments of the ninth, 
tenth, eleventh, and twelfth aspects, the first polypeptide is produced by a cell 
that contains a first fusion gene capable of expressing the first polypeptide, and 
the second polypeptide is produced by a cell that contains a second gene 
capable of expressing the second polypeptide. 

25 In various preferred embodiments of all of the above aspects of the 

invention, the polypeptide fragment of FAST-1 consists of, at maximum, 
Xenopus FAST-1 amino acids 380 to 506, human FAST-1 amino acids 234 to 
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365, and mouse FAST-1 amino acids 309 to 398. In other preferred 
embodiments of all of the aspects of the invention, the polypeptide fragment of 
Smad2 consists of, at maximum, Smad2 amino acids 248 to 467 or 274 to 467, 
and the polypeptide fragment of Smad3 consists of, at maximum, Smad3 amino 

5 acids 230 to 446, amino acids 253 to 446, amino acids 230 to 424, or amino 
acids 253 to 424. 

In the thirteenth aspect, the invention features a polypeptide 
comprising a polypeptide fragment of FAST-1 . In a preferred embodiment of 
this aspect of the invention, the polypeptide fragment of FAST-1 includes, at 

10 maximum, Xenopus FAST-1 amino acids 380 to 506, or fragments thereof, 

human FAST-1 amino acids 234 to 365, or fragments thereof, or mouse FAST- 
1 amino acids 309 to 398, or fragments thereof. 

In the fourteenth, fifteenth, sixteenth, and seventeenth aspects, the 
invention features a method for detecting a compound capable of modulating 

15 TGF-P superfamily signaling, comprising providing a reporter gene operably 
linked to a DNA-binding-protein recognition site and additionally providing 
either: 

a) a first fusion protein comprising a polypeptide fragment of FAST- 
1 covalently bonded to a binding moiety capable of specifically binding to the 
20 DNA-binding-protein recognition site and a second fusion protein comprising a 
polypeptide fragment of Smad2 covalently bonded to a gene activating moiety, 
h) a first fusion protein comprising a polypeptide fragment of Smad2 
covalently bonded to a binding moiety capable of specifically binding to the 
DNA-binding-protein recognition she and a second fusion protein comprising a 
25 polypeptide fragment of FAST-1 covalently bonded to a gene activating 
moiety, 

c) a first fusion protein comprising a polypeptide fragment of FAST- 
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1 covalently bonded to a binding moiety capable of specifically binding to the 
DN A -binding-protein recognition site and a second fusion protein comprising a 
polypeptide fragment of Smad3 covalently bonded to a gene activating moiety, 
or 

5 d) a first fusion protein comprising a polypeptide fragment of Smad3 

covalently bonded to a binding moiety capable of specifically binding to the 
DNA-binding-protein recognition site and a second fusion protein comprising a 
polypeptide fragment of FAST-1 covalently bonded to a gene activating 
moiety; 

10 exposing the first fusion protein to the second fusion protein, to the 

reporter gene, and to the compound; and measuring the reporter gene 
expression, a change in the reporter gene expression indicating a compound 
that is capable of modulating TGF-P superfamily signaling. 

In various preferred embodiments of the fifteenth, sixteenth, 

1 5 seventeenth, and eighteenth aspects, a change in reporter gene expression that 
is a decrease indicates a compound that is capable of inhibiting TGF-P 
superfamily signaling, and a change in the reporter gene expression that is an 
increase in the reporter gene expression indicates a compound that is capable of 
enhancing TGF-P superfamily signaling. In other embodiments, the 

20 polypeptide of FAST-1 includes Xenopus FAST-1 amino acids 380 to 506, or 
fragments thereof, human FAST-1 amino acids 234 to 365, or fragments 
thereof, or mouse FAST-1 amino acids 309 to 398, or fragments thereof; the 
polypeptide fragment of Smad2 includes Smad2 amino acids 248 to 467. or 
fragments thereof; and the polypeptide fragment of Smad3 includes Smad3 

25 amino acids 230 to 424, or fragments thereof. In yet another embodiment, 
providing the first fusion protein comprises providing a first fusion gene 
capable of expressing the first fusion protein and providing the second fusion 



WO 98/53830 



PCT/US98/10983 



-8- 

protein comprises providing a second fusion gene capable of expressing the 
second fusion protein. 

In the nineteenth aspect, the invention provides a method for 
diagnosing a mammal having or likely to develop a disorder involving 
5 abnormal TGF-p superfamily signaling. The method includes determining 
whether the mammal has a mutation in a gene encoding FAST-1 . In a 
preferred embodiment of this aspect, the mutation is in the Smad Interaction 
Domain (SID). 

In the twentieth aspect, the invention provides methods for 
1 0 diagnosing a mammal having or likely to develop a disorder involving 

abnormal TGF-P superfamily signaling comprising determining whether the 
mammal has an altered level of expression of FAST-1. 

In preferred embodiments of the nineteenth and twentieth aspects of 
the invention, the disorder is a developmental disorder, and the mammal is a 
1 5 human, and may be a fetus. 

In the twentieth aspect, the invention features a substantially pure 
mammalian FAST-1 protein or polypeptide fragment thereof for use in 
modulating TGF-P superfamily signaling. 

In preferred embodiments of the twentieth aspect, the protein or 
20 polypeptide fragment may be from a human or a rodent. In other preferred 
embodiments, the polypeptide fragment comprises the Smad Interaction 
Domain (SID). In still another preferred embodiment, the polypeptide 
fragment binds to Smad2 or Smad3. 

In a twenty-first aspect, the invention features a substantially pure 
25 polypeptide fragment comprising the Smad Interaction Domain (SID) of 
FAST-1 from Xenopus, for use in modulating TGF-p superfamily signaling. 

In related, twenty-second, twenty-third, and twenty-fourth aspects, 
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the invention features substantially pure polypeptides or fragments thereof 
having about 50% or greater ammo acid sequence identity, about 75% or 
greater amino acid sequence identity, and about 90% or greater amino acid 
sequence identity to the comparable amino acid sequence of the mammalian 
5 FAST-1 protein or polypeptide fragment thereof. Preferably, the identity is 
determined by comparison with the FAST-1 SID (i.e., FAST-1 amino acids 380 
to 509 of Xenopus FAST-1, amino acids 234 to 365 of human FAST-1, or 
amino acids 309 to 398 of mouse FAST-1). In another preferred embodiment, 
the polypeptide fragment binds to Smad2 or Smad3. 
10 In a twenty-fifth aspect, the invention features a substantially pure 

nucleic acid encoding a mammalian FAST-1 protein or polypeptide fragment 
thereof. 

In a twenty-sixth aspect, the invention features a vector containing a 
nucleic acid of the twenty-fifth aspect, capable of directing expression of the 

1 5 protein or polypeptide fragment thereof. 

In a twenty-seventh aspect, the invention features a substantially pure 
nucleic acid encoding a FAST-1 Smad Interaction Domain (SID). 

In a twenty-eighth aspect, the invention features a cell containing the 
vector of the twenty-sixth and twenty-seventh aspects above. 

20 In a twenty-ninth aspect, the invention features a method of 

modulating TGF-P superfamily signaling in a cell, comprising providing a cell 
intracellular^ with a substantially pure FAST-1 protein, or polypeptide 
fragment thereof, wherein the FAST-1 protein or polypeptide fragment is 
sufficient to modulate TGF-P superfamily signaling in a cell. 

25 In a thirtieth aspect, the invention features a method of modulating 

TGF-P superfamily signaling in a cell, comprising introducing, into a cell, a 
vector comprising a nucleic acid encoding FAST- 1 protein, or polypeptide 
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fragment thereof, wherein the vector is capable of directing expression of the 
protein or polypeptide fragment in a cell containing the vector, and wherein 
expression of the FAST- 1 protein or polypeptide fragment is sufficient to 
modulate TGF-P superfamily signaling in a cell. 

5 In preferred embodiments of the twenty-ninth and thirtieth aspects, 

the signaling may be decreased or increased. 

"Reporter gene" means any gene that encodes a product whose 
expression is detectable. Such genes include, without limitation, lacZ, amino 
acid biosynthetic genes, for example, the yeast LEU2, HIS3, LYS2, TRP1, or 

10 URA3 genes, nucleic acid biosynthetic genes, the mammalian chloramphenicol 
transacetylase (CAT) gene or GUS gene, or any surface antigen for which 
specific antibodies are available. Reporter genes may encode any enzyme that 
provides a phenotypic marker, for example, a protein that is necessary for cell 
growth or a toxic protein leading to cell death, or gene encoding a protein 

15 detectable by color assay or whose expression leads to an absence of color. 

Other preferred reporter genes are those encoding fluorescent markers, such as 
the green fluorescent protein (GFP)-encoding gene, or reporter genes encoding 
enzymes whose activity may be detected by chemiluminescence, such as 
luciferasc. Reporter genes may facilitate either a selection or a screen for 

20 reporter gene expression, and quantitative differences in reporter gene 
expression may be measured as an indication of interaction affinities. 

"Covalently bonded" means that two domains are joined by covalent 
bonds, directly or indirectly. That is, the "covalently bonded" proteins or 
protein moieties may be immediately contiguous or may be separated by 

25 stretches of one or more amino acids within the same fusion protein. 

"Protein" or "polypeptide" or "polypeptide fragment" means any 
chain of more than two amino acids, regardless of post-translational 



WO 98/53830 



PCT/US98/10983 



-11- 

modification (e.g., glycosylation or phosphorylation), constituting all or part of 
a naturally-occurring polypeptide or peptide, or constituting a non-naturally 
occurring polypeptide or peptide. 

"Smad2 protein or polypeptide fragment thereof means a Smad2 

5 protein (or polypeptide fragment or domain thereof) found in Xenopus or 

mammalian (e.g. mouse or human) cells. A preferred domain of Smad2 is the 
Mad Homology 2 (MH2) domain (i.e., amino acids 274 to 467 of human or 
Xenopus Smad2). Also preferred are polypeptide fragments comprising the 
MH2 domain, that consist of, at maximum, amino acids 274 to 467 or amino 

10 acids 248 to 467 of human or Xenopus Smad2, or the corresponding amino 
acids that comprise Smad2 MH2 domains from other species. These 
polypeptide fragments are capable of interacting with the FAST-1 Smad 
Interaction Domain (SID). 

"Smad3 protein or polypeptide fragment thereof means a Smad3 

1 5 protein (or polypeptide fragment or domain thereof) found in Xenopus or 

mammalian (e.g. mouse or human) cells. A preferred domain of Smad3 is the 
Mad Homology 2 (MH2) domain (i.e., amino acids 253 to 446 of human 
Smad3). Also preferred are polypeptide fragments comprising the MH2 
domain, thar consist of, at maximum, human Smad3 amino acids 230 to 446, 

20 and sub fragments thereof, consisting of, at maximum, amino acids 253 to 446, 
amino acids 253 to 424, or amino acids 230 to 424, or the corresponding amino 
acids that comprise Smad3 MH2 domains from other species. These 
polypeptide fragments are capable of interacting with the FAST-1 SID domain. 
"Mammalian FAST-1 protein or polypeptide fragment thereof 

25 means an amino acid sequence derived from a mammalian cell which displays 
at least 30%, preferably, 40%, more preferably 50%, still more preferably 60%, 
70%, or even 80%o means amino acid sequence identity to a FAST-1 Smad 
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Interaction Domain (SID), i.e., amino acids 380 to 506 of the Xenopus FAST-l 
protein, amino acids 234 to 365 of the human FAST-l protein, or amino acids 
309 to 398 of the mouse FAST-l protein. The length of comparison, generally 
will be at least 16 amino acids, preferably at least 20 amino acids, more 
5 preferably at least 25 amino acids, and most preferably at least 30 amino acids. 
Preferably, a mammalian FAST-l protein, or polypeptide fragment thereof, is 
able to bind Smad2. The FAST-l SID is a preferred polypeptide fragment of 
FAST-l. 

"Operably linked'' means that a gene and one or more regulatory 
10 sequences are connected in such a way as to permit gene expression when the 
appropriate molecules (e.g., transcriptional activator proteins) are bound to the 
regulatory sequences. 

"Binding moiety" means a stretch of amino acids which is capable of 
directing specific polypeptide binding to a particular DNA sequence (i.e., a 
15 "protein binding site"). 

"Modulatory compound" or "modulating compound", as used herein, 
means any compound capable of either increasing or decreasing the amount of 
signaling initiated by a TGF-P superfamily member. 

"Substantially pure protein" or substantially pure polypeptide" means 
20 a protein or polypeptide that has been separated from the components that 

naturally accompany it. Typically, the polypeptide is substantially pure when it 
is at least 60%, by weight, free from the proteins and naturally-occurring 
organic molecules with which it is naturally associated. Preferably, the 
polypeptide is a xenopus or mammalian, e.g. human or mouse, FAST-l 
25 polypeptide, or polypeptide fragment thereof, that is at least 75%, more 

preferably at least 90%, and most preferably at least 99%, by weight, pure. A 
substantially pure mammalian, e.g. human or mouse, FAST-l polypeptide, or 
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polypeptide fragment may be obtained, for example, by extraction from a 
natural source (e.g. a fibroblast, neuronal cell, or lymphocyte) by expression of 
a recombinant nucleic acid encoding a FAST-1 polypeptide, or by chemically 
synthesizing the protein. Purity can be measured by any appropriate method, 
5 e.g., by column chromatography, polyacrylamide gel electrophoresis, or HPLC 
analysis. 

A polypeptide is substantially free of naturally associated 
components when it is separated from those contaminants which accompany it 
in its natural state. Thus, a polypeptide which is chemically synthesized or 
1 0 produced in a cellular system different from the cell from which it naturally 
originates will be substantially free from its naturally associated components. 
Accordingly, substantially pure polypeptides include those which naturally 
occur in eukaryotic organisms but are synthesized in E. coli or other 
prokaryotes. 

15 "Substantially pure nucleic acid" means nucleic acid that is free of 

the genes which, in the naturally-occurring genome of the organism from which 
the nucleic acid of the invention is derived, flank the gene. The term therefore 
includes, for example, a recombinant nucleic acid that is incorporated into a 
vector; into an autonomously replicating plasmid or virus; or into the genomic 

20 nucleic acid of a prokaryote or eukaryote; or that exists as a separate molecule 
(e.g., a cDNA or a genomic or cDKA fragment produced by PCR or restriction 
endonuclease digestion) independent of other sequences. It also includes a 
recombinant nucleic acid that is part of a hybrid gene encoding additional 
polypeptide sequence. 

25 By "Substantially identical" means a polypeptide or nucleic acid 

exhibiting at least 75%. preferably 85%, more preferably 90%, and most 
preferably 95% identity to a reference amino acid or nucleic acid sequence. 
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For polypeptides, the length of comparison will generally be at least 16 amino 
acids, preferably at least 20 amino acids, more preferably at least 25 amino 
acids, and most preferably 35 amino acids. For nucleic acids, the length of 
comparison sequences will generally be at least 50 nucleotides, preferably at 

5 least 60 nucleotides, more preferably at least 75 nucleotides, and most 
preferably 1 10 nucleotides. 

Sequence identity is typically measured using sequence analysis 
software with the default parameters specified therein (e.g., Sequence Analysis 
Software Package of the Genetics Computer Group, University of Wisconsin 

10 Biotechnology Center, 1710 University Avenue, Madison, WI 53705). This 

software program matches similar sequences by assigning degrees of homology 
to various substitutions, deletions, and other modifications. Conservative 
substitutions typically include substitutions within the following groups: 
glycine, alanine, valine, isoleucine, leucine; aspartic acid, glutamic acid, 

15 asparagine, glutamine; serine, threonine; lysine, arginine; and phenylalanine, 
tyrosine. 

Default settings of sequence analysis software programs employ 
parameters that are considered, by those of skill in the art, to yield biologically 
significant results; i.e., an alignment of two polypeptides that shows one or 
20 more amino acid stretches having a high percentage of sequence identity 

represents iwo polypeptides that share a functional relationship. For example, 
FAST-1 polypeptides are identified by virtue of their possessing an amino acid 
sequence that displays at least 30% identity to a FAST-1 SID. 



25 



Brief Description of the Drawings 
Fig. 1 shows a schematic diagram indicating the interactions between 
FAST-1, Smad2, and DPC4 (Smad4) in ARF formation. WH and SID indicate 
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the winged helix region and the Smad Interaction Domain of FAST-1, 
respectively. 

Fig. 2 shows a supershifted electrophoretic mobility-shift assay 
(EMSA) demonstrating the incorporation of Smad4 into the ARF complex. 
5 Fig. 3 shows a supershifted EMSA demonstrating the presence of 

Smad2 and Smad4 within the same ARF complex. 

Fig. 4 shows a supershifted EMSA demonstrating that the 
Smad2/MH2 domain alone can be incorporated into the ARF complex. 

Fig. 5A shows a Western blot of whole lysates and anti-GST co- 
1 0 immunoprecipitates from Xenopus embryos co-microinjected with RNA 

encoding GST-FAST- 1 and Myc-Smadl or Myc-Smad2, plus or minus RNA 
encoding activin, demonstrating that Smad2 but not Smadl co-precipitates with 
FAST-1 in an activin-stimulated manner. 

Fig. 5B shows a Western blot of whole lysates and anti-GST co- 
1 5 immunoprecipitates from Xenopus embryos co-microinjected with RNA 
encoding GST-FAST- 1 and Myc-Smad4 (Myc-DPC4), plus or minus RNA 
encoding activin, demonstrating that Smad4 co-precipitates with FAST-1 in an 
activin-dependent manner. 

Fig. 6A shows a summary of experiments that tested the ability of 
20 Myc-tagged FAST-1 deletion mutants to become incorporated into the 

ARF/ARE complexes or to associate with Smad2 in an activin-dependent and - 
independent manner. 

Fig. 6B is a diagram of Myc-tagged FAST-1 showing the Smad 
Interaction Domain (SID) and the winged helix domain (ammo acids 108-219). 
25 Fig. 7 shows an EMSA demonstrating the inhibition of ARF/ARE 

complex formation by overexpression of the FAST-1 SID. 



WO 98/53830 



PCT/TJS98/10983 



-16- 

Fig. 8A shows an agarose gel containing RT-PCR amplification 
products that were generated using primers specific for the pan-mesodermal 
marker brachyury (Xbra), and the ubiquitously expressed marker EFla, 
demonstrating that the FAST-1 SID inhibits activin-induced brachyury 
5 expression. 

Fig. 8B (Panels A-F) shows a series of photographs of animal caps 
from control and experimentally-manipulated Xenopus embryos, demonstrating 
that the FAST-1 SID blocks activin induction of mesodermal cell movements 
in early embryos. 

10 Fig. 9 shows a Western blot analysis, with anti-Myc antibody, of 

whole ly sates and FAST-1 immunoprecipitates from lysates of control embryos 
and embryos expressing GST -tagged FAST-1 plus Myc-tagged Smadl MH2 
domain or Myc-tagged Smad2 MH2 domain, demonstrating that the Smad2 
MH2 domain co-immunoprecipitates FAST-1 in an activin-stimulated manner. 

15 Fig. 10 shows an amino acid sequence alignment of human, mouse, 

and Xenopus FAST- 1 . 

Detailed Description of the Invention 
It is now demonstrated that the interactions of a FAST-1 polypeptide 
fragment with Smad2 and Smad3 polypeptide fragments in vivo as well as in 
20 vitro are clearly involved in TGF-P superfamily signaling pathways in 
eukaryotic cells. 

In Xenopus laevis embryos, Smad2 is a component of the activin 
responsive factor (ARF) complex that binds to the ARE promoter element of 
the Mix. 2 gene. The major DNA binding component of the ARF is a novel 
25 winged helix transcription factor that we have named FAST-1 . In the present 
invention, we show that Smad4 is present in ARF, and that FAST-1, Smad4, 
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and Smad2 co-immunoprecipitate in an activin-regulated fashion. We have 
mapped the site of interaction between FAST-1 and Smad2/Smad4 to a novel 
C-terminal domain of FAST-1; overexpression of this domain specifically 
inhibits activin signaling. 

5 In a yeast 2-hybrid assay, the FAST-1 C-terminus was found to 

directly interact with Srnad2, but not Smad4. Furthermore, we can detect 
binding of the FAST-1 C-terminus to the MH2 domain of Smad2 in vitro. The 
results of these findings have allowed us to propose the model for ARF 
formation shown in Fig. 1. The interaction of FAST-1 and Smad2 domains 

10 provided in the present invention allows the identification of compounds 
capable of modulating the effects of TGF-P superfamily signaling and the 
identification of patients who either have or are likely to develop disorders 
involving abnormal TGF-P superfamily-mediated signal transduction. 

15 I. Uses for the Invention 

The methods and compounds provided in the invention allow 
modulation and simulation of the signaling pathways of TGF-P superfamily 
members. These methods and compounds may provide a means to detect 
treatments and to possibly treat or cure individuals with a variety of diseases, 

20 including, without limitation, developmental disorders, immunological 
disorders, and cancer. The invention also describes methods by which 
individuals may be identified who either have or are likely to develop disorders 
involving abnormal TGF-P superfamily signaling. 

II. FAST-L Smad2. and Smad3 Fragments 

25 We have found that polypeptide fragments comprising various 

portions of the FAST-1, Smad2 and Smad3 proteins have been useful in 
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identifying the domains important for the interaction of FAST- 1 (SEQ ID NO: 
1 1, 14, and 17) with either Smad2 (SEQ ID NO: 2 and 5) or Smad3 (SEQ ID 
NO: 8). Methods for generating such fragments are well known in the art (see, 
for example, Ausubel et al. f Current Protocols in Mole cular Biology. John 

5 Wiley & Sons, New York, NY, 1994) and are further described herein. For 
example, a Smad2 polypeptide fragment may be generated by amplifying the 
desired fragment by the polymerase chain reaction (PCR) using oligonucleotide 
primers designed according to the published Smad2 nucleic acid sequence 
(SEQ ID NO: 1 and 4). Preferably the oligonucleotide primers comprise 

10 unique restriction enzyme sites which facilitate insertion of the DNA fragment 
into the cloning site of a eukaryotic expression vector. Furthermore, the Smad2 
fragment may be tagged with an epitope (e.g., hemagglutinin (HA) or GST) by 
cloning the fragment into a epitope fusion expression vector. The Smad2 
fragment-bearing vector is then introduced into a prokaryotic or eukaryotic cell 

15 by artifice, using the various techniques known in the art and described herein, 
which result in the production of the Smad2 polypeptide fragment. Similar 
techniques using FAST-1 (SEQ ID NO: 10, 13, and 16) and Smad3 (SEQ ID 
NO: 7) nucleic acid sequences are employed to generate FAST-1 and Smad3 
polypeptide fragments. 

20 In one approach, FAST-1 and Smad2, or Smad3 polypeptide 

fragments may be used to evaluate the portions of these proteins involved in 
regulation of TGF-P signaling during Xenopus laevis embryogenesis. In 
particular, polypeptide fragments comprising the domains of the FAST-K 
Smad2, and Smad3 proteins responsible for the interaction of FAST-1 with 

25 either Smad2 or Smad3 may be used to induce TGF-0 superfamily signaling, or 
to prevent TFG-P superfamily signaling. 
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III. Screens for Compounds Which Modulate TGF-P Superfamily Signaling 

FAST-1 and Smad2 or FAST-1 and Smad3 may be used to facilitate 
the identification of compounds that increase or decrease TGF-p superfamily- 
mediated signal transduction. In one approach, compounds that modulate the 

5 signals generated by the TGF-P superfamily are detected by screening for 

compounds that alter the physical interaction between the FAST-1 SID domain 
(SEQ ID NO: 12, 15, and 18) and the Smad2 (SEQ ID NO: 3 and 6) or Smad3 
(SEQ ID NO: 9) MH2 domain. These compounds are detected by adapting 
yeast two-hybrid expression systems known in the art for use as described 

10 herein. These systems which allow detection of protein interactions via a 

transcriptional activation assay, are generally described by Gyuris et al. (Cell 
75:791-803, 1993) and Fields et al (Nature 340:245-246, 1989), and are 
commercially available from Clontech (Palo Alto, CA). 

In this approach, a region of FAST-1, which we have discovered 

15 interacts with Smad2, is fused to the GAL4-DNA-binding domain by 

subcloning a DNA fragment encoding this, the FAST-1 Smad Interaction 
Domain (SID), into the expression vector, pGBT9, provided in the 
MATCHMAKER Two-Hybrid System kit commercially available from 
Clontech (catalog number K1605-1). A fusion of the GAL4 activation domain 

20 with the MH2 domain of Smad2 or Smad3 (which interacts with the FAST- 1 
SID) is generated by subcloning the Smad2 or Smad3 MH2 domain-encoding 
DNA fragment into the expression vector, PGAD424, also provided in the 
Clontech kit. Analogous expression vectors may also be used. Yeast 
transformations and colony lift filter assays are carried out according to the 

25 methods of MATCHMAKER Two-Hybrid System and various methods known 
in the art. Prior to the colony filter assay, transformed yeast may be treated 
with candidate compounds being screened for the ability to modulate TGF-P 
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signaling. The interaction results obtained using the candidate compound in 
combination with the yeast system may then be compared to those results 
observed with the yeast system not treated with the candidate compound, all 
other factors (e.g., cell type and culture conditions) being equal. A compound 
5 capable of modulating TGF-P superfamily-mediated signaling is able to alter 
the interaction between the Smad2 or Smad3 MH2 domain and the FAST-1 
SID. 

In another embodiment of this approach, a compound capable of 
decreasing TGF-P superfamily signaling by disrupting the binding of the 

10 Smad2 (or Smad3) MH2 to the FAST-1 SID may be isolated using the 

modified yeast-two hybrid system described above, in which the reporter gene 
encodes a protein, such as ricin, that is toxic to yeast. Yeast cells containing 
such a ricin reporter gene die unless the binding of Smad2 MH2 to FAST-1 
SID is disrupted. Yeast cells treated with a compound that disrupts the 

15 Smad2/FAST-1 interaction form viable colonies, and from this result it may be 
inferred that the compound is capable of decreasing, and possibly inhibiting, 
signals initiated by members of the TGF-P superfamily. 

In another approach, compounds capable of inhibiting signaling by 
TGF-p and other members of the TGF-P superfamily may be identified in vitro 

20 using assays that detect disruption of the in vitro binding of FAST-1 SID to the 
Smad2 (or Smad3) MHZ domain. For example, in order to detect FAST- 
1/Smad interactions, the FAST-1 SID domain is fused to glutathione S- 
transferase (GST) by subcloning the FAST-1 SID-cncoding DNA fragment into 
a bacterial expression vector that encodes a GST tag. Such vectors are well 

25 known in the art and are commercially available (e.g., the pGEX fusion vectors 
commercially available from Pharmacia). GST-tagged FAST-1 SID fusion 
protein is produced by transforming the GST-FAST- 1 SID-encoding vector 
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into E. coli bacteria. Fusion proteins are then purified by allowing the proteins 
from lysed bacteria to bind to glutathione sepharose-coated beads. The GST- 
tagged FAST-1 SID-bearing beads are then used to specifically bind Myc- 
tagged Smad2 (or Smad3) MH2 domains polypeptides produced in Xenopus 
5 embryos. Detection of FAST-1 /Smad2 (or Smad3) interactions are assessed by 
resolving the glutathione-immobilized proteins by Laemmli gel electrophoresis 
and subjecting the resolved proteins to Western blot analysis using anti-Myc 
antibodies. 

In order to detect compounds that inhibit TGF-P superfamily 

10 signaling by disrupting FAST-1/Smad2 (or Smad3) interactions, Xenopus 
embryo ly sates containing Myc-tagged Smad2 (or Smad3) MH2 domain 
polypeptides are incubated with a candidate TGF-p signaling modulatory 
compound prior to the incubation with glutathione Sepharose-coated beads 
carrying the GST-tagged FAST-1 SID. Glutathione-immobilized proteins from 

1 5 treated vs. untreated Xenopus embryo lysates are then subjected to Western 
blotting with anti-Myc antibodies. A difference in the amount of anti-Myc 
reactivity of the glutathione-immobilized proteins from treated samples vs. 
untreated samples indicates that the test compound modulates TGF-P 
superfamily-mediated signal transduction. 

20 GST-tagged FAST-1 SID fusion proteins may be immobilized on a 

solid-state substance for rapid high-throughput identification of compounds that 
modulate TGF-P superfamily signaling. Preferably, the solid-state substance is 
the bottom of a well on a 96-well (or similar) plate. Each well may then be 
provided with a known amount of the MH2 domain of either Smad2 or Smad3 

25 that is tagged with a readily detectable epitope (e.g., an short polypeptide 
fragment, e.g., HA or Myc, that is specifically recognized by an antibody). 
Preferably, a Smad2 or Smad3 MH2 domain tagged with the alkaline 
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phosphatase (AP) enzyme is added to each GST-tagged FAST-1 SID-bearing 
well. Candidate compounds to be screened for an ability to modulate TGF-P 
superfamily signaling are then added individually or in combination to each 
well on the plate. After allowing the interaction of the components in each 

5 well, the plate is washed, and the substrate for AP is added to each well. A 
compound that modulates TGF-p superfamily signaling may affect the binding 
affinities of the FAST-1 SID and the Smad2 or Smad3 MH2 such that the 
amount of bound Smad MH2, and hence, bound AP enzymatic activity, is 
altered. Preferable AP substrates are colorimetric substrates, such as the nitro 

10 blue tetrazolium (NBT) and 5-bromo-l-chloro-3-indolyl-phosphate (BCIP) 
reagents that are commercially available (e.g., from Promega). 

After allowing formation of the blue/black precipitate to occur in a 
control well that has not been treated with a candidate compound, the plate is 
quantitated for color intensity on a 96-well plate reader. A compound that 

1 5 affects the color intensity of AP substrates when added to a well, as compared 
to a well not treated with a compound, indicates a compound that has the ability 
to modulate TGF-P superfamily mediated signal transduction. 

Molecules that are found to effectively modulate TGF-P superfamily 
signaling, using the methods described above, may be further tested using in 

20 vivo animal models. Compounds that function effectively in an in vivo setting 
may be used as therapeutics to either inhibit or enhance TGF~p superfamily 
member-mediated signaling, as appropriate. 

IV. Administration of Modulators of TGF-p Superfamily Signal Transduction 
A TGF-P superfamily signaling modulator may be administered 
25 within a pharmaceutically-acceptable diluent, carrier, or excipient, in unit 
dosage form. Conventional pharmaceutical practice may be employed to 
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provide suitable formulations or compositions to administer a TGF-p 
superfamily signaling modulator(s) to patients suffering from a disease (e.g., a 
developmental disease) that is caused by an abnormal amount of TGF-P 
superfamily member-mediated signal transduction. Administration may begin 
5 before the patient is symptomatic. Any appropriate route of administration may 
be employed, for example, administration may be parenteral, intravenous, intra- 
arterial, subcutaneous, intramuscular, intracranial, intraorbital, ophthalmic, 
intraventricular, intracapsular, intraspinal, intracisternal, intraperitoneal, 
intranasal, aerosol, by suppositories, or oral administration. Therapeutic 

1 0 formulations may be in the form of liquid solutions or suspensions; for oral 
administration, formulations may be in the form of tablets or capsules; and for 
intranasal formulations, in the form of powders, nasal drops, or aerosols. 

Methods well known in the art for making formulations are found, 
for example, in "Remington's Pharmaceutical Sciences." Formulations for 

1 5 parenteral administration may, for example, contain excipients, sterile water, or 
saline, polyalkylene glycols such as polyethylene glycol, oils of vegetable 
origin, or hydrogenated napthalenes. Biocompatible, biodegradable lactide 
polymer, lactide/glycolide copolymer, or polyoxyethylene-polyoxypropylcne 
copolymers may be used to control the release of the compounds. Other 

20 potentially useful parenteral delivery systems for TGF-P superfamily signaling 
modulatory compounds include ethylene-vinyl acetate copolymer particles, 
osmotic pumps, implantable infusion systems, and liposomes. Formulations for 
inhalation may contain excipients, for example, lactose, or may be aqueous 
solutions containing, for example, polyoxyethylene-9-lauryl ether, glycocholate 

25 and deoxycholate, or may be oily solutions for administration in the form of 
nasal drops, or as a gel. 

Dosage is determined by standard techniques and is dependent, for 
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example, upon the weight of the patient and the type or extent of disorder being 
treated. 



V. Diagnostics for Disorders involving Abnormal TGF-B Superfamilv 
Signaling 

5 To determine whether an individual either has or is likely to develop 

a disorder (e.g., a developmental disorder) involving abnormal TGF-p 
superfamily signaling, that individual may be screened for mutations in the 
domains (e.g., the SID of FAST- 1 or the MH2 domains of Smad2 and Smad3) 
of the genes encoding FAST-1 , Smad2, and Smad3 that mediate the binding 

10 interactions of FAST-1 with Smad2 or Smad3. Screening for mutations may be 
carried out using any standard technique including, without limitation, methods 
involving sequencing, or mismatch binding or cleaving assays. For example, a 
nucleic acid sample may be derived from cells of an individual to be tested for 
a mutation (for example, by PCR amplification), and the FAST-1, Smad2, and 

15 Smad3 genes may be subjected to rapid sequence analysis by automated 
sequencing techniques using primers generated from FAST-1, Smad2. and 
Smad3 sequences described in the art and herein. 

Alternatively, an individual who either has or is likely to develop a 
disorder involving abnormal TGF-p superfamily signaling may be screened for 

20 altered expression of FAST-1, Smad2 or Smad3. Such assays may be carried 
out, for example, using any standard nucleic acid-based assay (e.g., Northern 
blot analysis) or immunological assay (e.g., enzyme-linked immunosorbent 
assay (ELISA)), preferably in a high through-put assay format. For example, 
cells may be obtained from an individual to be tested, and analyzed by ELISA 

25 for the expression of FAST-1 , Smad2, or Smad3 proteins, using as probes, 

fluorophore-tagged antibodies directed against these proteins. Individuals that 
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have altered protein levels relative to the general population, are readily 
identified using such ELISA-based assays. 

VI. FAST-1 Related Genes 

Standard techniques, such as the polymerase chain reaction (PCR) 
5 and DNA hybridization, may be used to clone additional FAST-1 homologues 
in other species. In order to detect such homologues, genomic DNA of various 
organisms (e.g., humans or mice) may be analyzed by Southern blotting using 
nucleic acid probes generated from the nucleic acid sequences encoding 
Xenopus FAST-1. Hybridization at low stringency should reveal bands that 

10 correspond to DNA encoding FAST-1 and/or related family members. 

Xenopus FAST-1 nucleic acid probes may be based upon the codon preference 
of the organism, whose DNA is under analysis, or they may be degenerate 
probes based upon all possible codon combinations, or they may be a 
combination of codon preference and codon degeneracy. Such probes may also 

15 be used to screen either genomic or cDNA libraries for sequences that 
hybridize to the probe. FAST-1 nucleic acid probes also may be used as 
primers to clone additional FAST-1 related genes by RT-PCR, using methods 
known in the art. 

Another method for identifying mammalian homologues of the 

20 FAST- 1 is by searching publically available databases for sequences that share 
sequence identity with the Xenopus FAST-1 nucleic acid or amino acid 
sequence (Genbank accession number U70980), or with sequence fragments 
thereof. A particularly preferred FAST-1 sequence fragment is the sequence 
corresponding to the Smad Interaction Domain (SID) of FAST-1 . Once 

25 identified, a candidate mammalian homologue of FAST-1 (or polypeptide 

fragment thereof) may be tested for FAST-1 -like activity (e.g., ability to bind 
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the Srnad2 or Smad3 MH2 domain), using the assays described herein. 

The following examples are meant to illustrate the invention. They 
are not meant to limit the invention in any way. 



EXAMPLE I 

5 Smad4 fDPC4) is a component of ARK 

Signaling by TGF-P superfamily members induces transcriptional 
activation of target genes. Some of these transcriptional responses are 
necessary and sufficient for the specification or patterning of mesoderm. 
Several TGF-P superfamily-responsive genes show immediate-early responses, 

10 ie., they are induced even when translation is inhibited by cycloheximide. Two 
such genes, Mix.l and Mix. 2, are transcriptionally activated by signals initiated 
by TGF-P superfamily members such as activin, Vg-1 , TGF-p, and BMP4. In 
contrast, Mix.l and Mix.2 are not transcriptionally activated by non-TGF-P 
mesoderm inducers or axial modifiers. 

15 An activin-responsive factor (ARF) was identified using an 

electrophoretic mobility-shift assay (EMSA) for embryonic proteins that bind 
to the Mix.2 promoter elements. The ARF, which is induced in embryonic 
blastomeres after 5-30 minutes of activin stimulation, binds specifically to a 50- 
bp Mix.2 promoter element. FAST-1 was identified as the major DNA-binding 

20 component of the ARF complex. 

Smad2, which associates in a ligand regulated manner with another 
member of the Smad family, Smad4 (DPC4), is a compound of the ARF 
complex. Therefore, we asked whether Smad4 is also a component of ARF. 
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Methods 

HA-tagged Smad4 was provided by Akiko Hata and Joan Massague, 
and the untagged full length Smad4 construct was provided by P. Hoodless and 
J. Wrana. We have previously described. the Xenopus activin encoding 
5 construct (Thomsen et ah, Cell 63: 485-493). These constructs were in vitro 
transcribed according to standard techniques described in, for example, Krieg 
and Melton (Meth. Enzymol. 155: 397-415, 1987). 

Xenopus laevis embryos at the 2-cell stage were microinjected in 
both blastomeres with 0.5-2 ng of RNA encoding HA-tagged or untagged 

10 Smad4 (DPC4) with or without RNA encoding activin, as indicated in Fig. 2. 
Embryos were maintained in IX MMR containing 3% Ficoll during 
microinjection, after which embryos were transferred to 0.1X MMR. Embryos 
were harvested for EMSA lysates at Stage 9 as previously described (Huang et 
al., EMBO J 14:5965-5973, 1995). Staging of embryos was done according to 

1 5 Nieuwkoop and Faber ( Normal Table of Xenopus laevis (Daudinl . Second 
edition ed. North Holland Publishing Company, Amsterdam. 1967). 

EMSA was performed as previously described (Huang et al., supra) 
using as a probe the 32 P-labeled ARE from the Mix.2 promoter (Chen, et al., 
Nature 383:691-696, 1996). For supershift assays, EMSA assay mixtures were 

20 incubated with anti-HA antibody (commercially available from (iibco-BRJL) 
for 1 hour on ice prior to SDS-PAGE and autoradiography. 

Results 

Fig. 2 shows a supershifted electrophoretic mobility-shift assay 
(EMSA) demonstrating the incorporation of Smad4 into the ARF complex. 
25 HA-tagged (lanes 2, 3, 7, and 8) or untagged (lanes 5 and 6) Smad4 was 

expressed in early Xenopus embryos and incorporation of HA-tagged protein 
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into ARF was tested by co-incubation of EMSA mixtures with anti-HA 
antibody (lanes 3, 6, and 8). "Activin" indicates samples in which activin was 
co-expressed. "ssARF" (super-shifted ARF) indicates increased mobility of the 
anti-HA antibody-bound ARF. 

5 The EMSA experiment of Fig. 2 shows that Smad4 (DPC4) is a 

component of the ARF complex. A supershift of the ARF complex by anti-HA 
antibody is dependent upon the presence of HA-Smad4 within the embryo 
lysate (Fig. 8, lane 2). However, overexpression of Smad4 in the absence of 
stimulation by activin is not sufficient for ARF formation, since supershifts 

10 were detected only in embryos co-injected with HA-Smad4 RNA plus activin 
RNA, but not in embryos injected with HA-Smad4 RNA alone. Hence, the 
binding of ligand (in this case, activin) to a TGF-P superfamily receptor 
appears to provide additional signals that are necessary for ARF formation. 



EXAMPLE II 

15 Smad4 TDPC4) and Smad2 co-associate in ARF complexes . 

Incorporation of Smad4 (DPC4) into ARF might reflect the co- 
association of Smad2 ? Smad4, and FAST-1 within the same complex. 
Alternatively, there might be two types of ARF: a Smad2-containing ARF, 
which would predominate in the presence of overexpressed Smad2, and a 

20 Smad4-containing ARF, which would predominate in the presence of 

overexpressed Smad4. We next determined which of these two models was 
correct. 



25 



Methods 

Xenopus Smad2 (provided by J. Graff and D. Melton) carrying six 
consecutive Myc epitope tags at the Smad2 N -terminus was generated by 
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cloning the Smad2 coding region into the fusion vector pCS2(+)MT, which 
encodes the Myc tags (Thomsen et ah, Cell 63: 485-493, 1990; Turner and 
Weintraub, Gen. and Dev. 8; 1434-1447, 1994). Smad3 was similarly N- 
terminally fused to six Myc tags. 
5 RNAs encoding Myc-tagged Smad2, HA-tagged Smad4, and activin 

were co-injected into two-cell Xenopus laevis embryos according to the method 
described in Example I. Stage 9 embryos were harvested and assayed by 
supershift-EMSA with anti-Myc or anti-HA antibodies as described in Example 
I. 



10 Results 

Fig. 3 shows a supershifted EMSA demonstrating the presence of 
Myc-tagged Smad2 and HA-tagged Smad4 within the same ARF complex. 
ARF complexes that were supershifted using only anti-HA antibody are 
designated "HA-ssARF", those using only anti-Myc antibody, "Myc-ssARF", 

1 5 and those that were supershifted using both antibodies are designated 
HA+Myc-ssARF. 

The addition of both anti-HA and anti-Myc antibodies resulted in a 
more highly supershifted ARF, relative to supershifted ARFs produced by 
either antibody alone. This result strongly suggests that Smad2 and Smad4 co- 

20 exist within the same ARF complex, rather than within two discrete subsets of 
ARF complexes. We obtained identical results Myc- tagged Smad3 in place of 
Myc-tagged Smad2. Consistent with these results, we observed that the 
simultaneous addition of anti-HA and anti-Myc antibodies to lysates from 
embryos expressing HA-tagged Smad4 plus untagged Smad2, or Myc-tagged 

25 Smad2 plus untagged Smad4, resulted in supershifted complexes analogous to 
those produced by using only one anti-epitope antibody. 
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EXAMPLE III 

The Smad2 MH2 domain alone can be incorporated into the ARF complex 

The Smad2 Mad Homology 2 (MH2) domain is necessary for 
Smad2 -dependent transcriptional activation. In order to determine whether the 
5 MH2 domain is also necessary for incorporation of Smad2 into the ARF, we 
used supershift-EMS A to ask whether the Smad2 MH2 domain alone could be 
incorporated into ARF complexes. 

Methods 

The Smad2 MH2 domain was tagged with the FLAG epitope by 
PCR- amplifying a DNA sequence encoding the Smad2 MH2 domain (Smad2 
amino acids 248-467) and subcloning the PCR product into the pCS2+ vector 
(previously described by Thornsen et al., Cell 63: 485-493, 1 990; and Turner 
and Weintraub, Gen. and Dev. 8; 1434-1447, 1994). 

FLAG-taggcd Smad2 MH2, HA-tagged Smad4, and activin were co- 
expressed in two-cell Xenopus laevis embryos according to the method 
described in Example I. Stage 9 embryos were harvested and assayed for 
incorporation into the ARF by supershift-EMSA using anti-FLAG or anti-HA 
antibodies, as described in Example 1. 

Results 

20 Fig. 4 shows a supcrshifted EMS A demonstrating that the Smad2 

MH2 domain alone can be incorporated into the ARF. ARF complexes that 
have increased mobility due to the incorporation of Smad2-MH2 (rather than 
full-length Smad2) are indicated by "MH2-ARF", FLAG-Smad2-MH2-ARF 
complexes supershifted with anti-FLAG are indicated by 



10 
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"FLAG-ssMH2ARF", and Smad2-MH2 -ARF complexes supershifted by 
anti-HA (recognizing Smad4 in the complex) are indicated by 
"HA-ssMH2ARF\ 

The Smad2 MH2 domain was part of a complex that bound ARE, but 
5 that migrated more rapidly than did endogenous ARF (presumably due to the 
reduced molecular weight of complexes containing the Smad2 MH2 domain as 
opposed to full-length Smad2 (Fig. 4). Incorporation of the Smad2 MH2 
domain into ARF was activin-dependent (Fig. 4, lanes 2 and 4). ARF 
complexes that contain the Smad2 MH2 domain are supershifted by anti-HA 
10 antibody, indicating that these complexes also contain HA-taggcd Smad4 (Fig. 
4, Lane 9). 



EXAMPLE IV 

Co-immunoprecipitation of FAST-1 with Smad2 

In the experiments descnbed in the previous examples, EMSA was 
1 5 used to detect the binding of ARF to its DNA target, ARE. To study 

FAST-1/Smad2 interactions in the absence of ARF/ARE complex formation, 
we asked whether FAST-1 and Smad2 could be co-immunoprecipitated from 
Xenopus embryo lysates. 

Methods 

20 Myc-tagged Smadl was generated by subcloning the sequences 

encoding Smadl from the FLAG-tagged Smadl construct into the pCS2(+)MT 
vector, which is a modification of the pCS2+ vector originally described in 
Thomsen et al., supra and Turner and Weintraub, supra. FAST-1 was tagged 
by N-terminal fusion at amino acid 61 with GST. 
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Xenopus laevis embryos at the 2-cell stage were co-injected either 
with RNA encoding GST-tagged FAST-1 plus RNA encoding Myc-tagged 
Smadl, or with RNA encoding GST-tagged FAST-1 plus RNA encoding Myc- 
tagged Smad2, both plus or minus co-injection of RNA encoding activin. 
5 Embryos were harvested at Stage 9 in lysate buffer (as described in 

Example I), and cleared by centrifiigation for 1 5 minutes at 32,000x g. Cleared 
ly sates were immunoprecipitated with anti-GST tag antibody for 1 hr at 4°C, 
and then incubated with protein A-sepharose for 30 min. The beads were then 
washed under the following conditions: IX lysate buffer 0.1% NP40, IX lysate 

10 buffer + 0.4M NaCl, IX lysate buffer + 0.5% NP40, IX lysate buffer + 0.2M 
NaCl, 0.25%NP40, IX lysate buffer. Samples were fractionated by 
electrophoresis and transferred to nitrocellulose. The nitrocellulose- 
immobilized immunoprecipitates were blotted with anti-Myc antibody and 
immunoreactive bands were detected by ECL as previously described (sec 

15 LaBonne et al., Development 121: 1475-1486, 1995). In parallel, whole- 
embryo lysates were also subjected to electrophoresis, transferred to 
nitrocellulose, blotted with anti-Myc antibody, and subjected to ECL detection. 

Results 

As shown in Fig. 5A, Myc-tagged Smad2 co-immunoprecipitated 
20 with GST-tagged FAST-1 , and Smad2/FAST- 1 complex formation was 

enhanced by activin stimulation (lanes 6 and 7). The anti-GST antibody did not 
immunoprecipitate Myc-tagged Smad2 from lysates of embryos not expressing 
GST-tagged FAST-1, indicating that Smad2 immunoprecipitation resulted from 
its specifically interacting with FAST-1. 
25 In contrast to Smad2 and FAST- 1 , co-expression of tagged Smad 1 

and FAST- 1 did not lead to detectable co-immunoprecipitation of Smad-1 (Fig. 
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5 A, lanes 3 and 4). Equivalent expression of Smadl and Smad2 in embryos 
was confirmed by Western analysis of whole embryonic lysate (Fig. 5 A, lower 
panel). Therefore, it appears that Smadl and FAST-1 do not directly interact, 
or do not interact as strongly as do Smad2 and FAST-1 . 

5 Ligand-induced activation of TGF-P superfamily receptors is 

apparently not a prerequisite for Smad2/FAST-1 complex formation: Myc- 
tagged Smad2 co-immunoprecipitated with FAST-1 even in the absence of 
stimulation by activin, although Smad2/FAST-1 complex formation increased 
in lysates from embryos injected with activin mRNA. At high levels of FAST- 

10 1 expression, Smad2 co-precipitation was nearly equivalent in lysates from 
activin-stimulated and unstimulated embryos (similar results were obtained 
using Myc-tagged Smad3 in place of Smad2, or by first immunoprecipitating 
Smad2 and then detecting co-immunoprecipitated FAST-1 by Western blot 
analysis). 

1 5 We were surprised to observe the activin-independent co- 

precipitation of Myc-Smad2 and GST-FAST-1, since EMSA experiments 
described in previous examples showed that supershifting of ARF/ARE 
complexes by anti-Myc (i.e. Myc-Smad2) antibody was activin-dependent. 
This result suggests that in addition to the activin-dependent formation of 

20 FAST-l/Smad2-containing complexes that are competent for DNA binding, 
there exist activin-dependent complexes that are either not competent for DNA 
binding, or not stable under our EMSA assay conditions. 

EXAMPLE V 

Co-precipitation of FAST-1 with Smad4 (DPCA) 
25 In order to study the nature of FAST-1 /Smad4 (DPC4) 

interacterations prior to ARE/ARF complex formation, we asked whether 
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Smad4 and FAST-1 could be co-immunoprecipitated from activin-stimulated 
and unstimulated Xenopus embryo lysates. 

Methods 

Myc-tagged Smad4 was constructed by cloning the full-length 
5 Smad4 into the pCS2(+)MT vector, which is described in Example II. 

Myc-tagged Smad4 was co-expressed with GST-tagged FAST-1 in 
Xenopus embryos in the presence or absence of activin stimulation. The 
microinjected embryos were lysed, GST-tagged FAST-1 was 
immunoprecipitated with anti-GST antibody, and immunoprecipitates were 
10 subjected to Western blot analysis using anti-Myc antibody, as described in 
Example IV. 



Results 

Myc-tagged Smad4 (DPC4) was co-immunoprecipitated with GST- 
tagged FAST-1 from lysates of activin-stimulated embryos. However, such 

15 Smad4/FAST-1 complexes were not evident above background in lysates from 
unstimulated embryos (Fig. 5B). Identical results were obtained by 
immunoprccipitating Myc-tagged Smad4 and performing immunoblots using 
an anti-GST (GST-FAST-1) antibody, or by substituting FLAG-tagged FAST-1 
for GST-tagged FAST-1 . Hence, the association of Smad4 with FAST-1 

20 requires prior activin stimulation. 



EXAMPLE VI 

Deletion Analysis of FAST-1 

FAST-1 contains a predicted winged helix DNA binding domain, but 
has no extensive homologies to other winged helix factors or other known 
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proteins outside the DNA binding domain. To identify the regions of FAST- 1 
that are important for its incorporation into ARF, we expressed epitope-tagged 
deletion mutants of FAST- 1 in early embryos and tested them for incorporation 
into ARF by antibody supershift-EMSA. 

5 Methods 

FAST-1 was tagged by N-terminal fusion at amino acid 61 with 6 
Myc tags, by cloning FAST-1 cDNA encoding amino acids 61 to 534 into the 
pCS2(+)MT vector (see Example II) to generate the Myc-tagged 61-534 FAST 
construct. The Myc-tagged A 1-366 FAST-1 construct was generated by 
10 cloning FAST-1 cDNA encoding amino acids 366 to 534 into the pCS2(+)MT 
vector. 

Various deletion mutants of FAST-1 were constructed from the Myc- 
tagged FAST 61-534: Myc-tagged A516-534; Myc-tagged A506-534; Myc- 
tagged A473-534; Myc-tagged A281-366; Myc-tagged A366-380; Myc-tagged 

15 A366-407; Myc-tagged A207-453; Myc-tagged A366-473; Myc-tagged A366- 
534; Myc-tagged A380-407; and Myc-tagged A453-506. Messenger RN As 
encoding the Myc-tagged FAST-1 deletion mutants plus mRNA encoding 
activin were microinjected into two-cell Xenopus laevis embryos. Stage 9 
embryos were harvested for EMSA as described in Example I, and the ability 

20 of anti-Myc antibodies to supershift ARF/ ARE complexes was assessed. 

Messenger RNAs encoding FAST-1 deletion mutants also were co- 
microinjected either with RNA encoding Myc-tagged Smad2 plus or minus 
RNA encoding activin, or with RNA encoding HA-taggcd Smad4 (DPC4) plus 
RNA encoding activin, and EMSA lysates were prepared from stage 9 

25 embryos. ARF/ARE complexes that were supershifted by anti-Myc- or anti- 
HA-specific antibodies indicated FAST-1 deletion mutants that retained the 



WO 98/53830 



PCT/US98/10983 



-36- 

ability to associate with Smad2 or Smad4, respectively. 
Results 

In order to determine which regions of FAST- 1 interact with Smad2 
and Smad4 and which are necessary for incorporation into ARF and for 
5 ARF/ARE complex formation, mRNAs encoding epitope-tagged FAST-1 
deletion mutants were co-expressed with tagged Smad2 or Smad4 plus or 
minus activin. The summarized results of these experiments are shown in Fig. 
6A, and FAST-1 polypeptide domains that are necessary for interactions with 
Smads and for ARF/ARE complex formation are shown in Fig. 6B (TAG=Myc 
10 tag; amino acids 108-219 delineate the winged helix domain). 

Deletions N-terminal to the forkhead domain (N-terminal to FAST-1 
amino acid 107) do not appear to reduce incorporation of FAST-1 into ARF. 
Moreover, deletion of the N-terminal two-thirds of FAST-1 (up to amino acid 
365), including the entire winged helix domain, does not reduce ligand- 
15 dependent association of FAST-1 with Smad2 or Smad4 and, hence, does not 
reduce incorporation of FAST-1 into ARF. 

However, as shown in Fig 6A, FAST-1 deletion mutants that lack the 
winged helix DNA-binding domain arc not incorporated into ARF/ARH 
complexes. These data show that the winged helix domain is not necessary for 
20 FAST-l/Smad interactions and for FAST-1 incorporation into the ARF, but is 
necessary for ARF/ARE complex formation. 

Deletions from the N-terminal side of the C-terminal half of FAST-1 , 
up to amino acid 380 (e.g., A 281-366), also allowed association of FAST-1 
with Smad2 or Smad4 in an activin-dependent manner, as did a 29-amino acid 
25 C-terminal deletion. However, deletion of an additional 33 C-termina! amino 
acids prevented association of FAST-1 with Smad2 or Smad4. Although 
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deletions of the C-terminus beyond the C-terminal 29 amino acids prevents 
FAST-1 incorporation into ARF, such deletions do not affect the ARE-binding 
activity of FAST-1 itself. 

The FAST-1 domain responsible for co-precipitation with Smad2 or 
5 Smad4 localizes to a 126 amino acid C-terminal domain (380-506); this domain 
also is necessary for incorporation of FAST-1 into ARF/ARE complexes. We 
call this region of FAST-1 the Smad Interaction Domain (SID). Additional 
FAST-1 deletions (A207-453, A506-518, and A473-518) allowed us to delimit 
the region necessary for activin-dependent association of FAST-1 and Smad2 

10 to amino acids 453-506. Experiments using a construct with a deletion 
(A366-473) from the N-terminal side of the SID showed that amino acids 
C-terminal to position 473 are sufficient for reduced, but still significant, 
activin-dependent association of FAST-1 with Smad2, but are insufficient for 
mediating interactions with Smad4, or for ARF/ARE complex formation. 

15 Companson of the regions of FAST-1 necessary for ARF/ARE 

formation with those regions necessary for co-immunoprecipitation with 
Smad2 and with Smad4 revealed a subregion of the FAST-1 SID (amino acids 
380-407) that was necessary for activin-dependent ARF/ARE complex 
formation, but not necessary tor the co-precipitation of Smad2 with FAST-1. 

20 How r ever, this region was necessary for the co-immunoprecipitation of Smad4 
with FAST-1. This finding, in combination with the observation that there 
were no deletion mutants of FAST-1 that co-immunoprecipitatcd with Smad4 
but not with Smad2 in an activin-independent manner, suggests that Smad2 and 
FAST-1 initially interact in an activin-indcpcndcnt manner (i.e., prior to 

25 engagement of the TGF-p receptor by ligand), and that activin-stimulated 

phosphorylation of the Smad2 C-terminus enhances the association between the 
Smad2 MH2 domain and the region of FAST-1 encompassing amino acids 
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453-506, as determined from experiments using mutants A366-407, A380-407, 

and A207-453. This activin-dependent step allows Smad4 to interact with 

Smad2andFAST-l. 

Although FAST-1/Smad2 interaction occurs in the absence of Smad4 
5 binding of the FAST-1/Smad2 complex to the ARE is not observed for FAST- 1 

mutants that are unable to bind Smad4. Hence, interaction among Smad4, 

Smad2, and a FAST-1 domain encompassing amino acids 380-407 is necessary 

for the formation of stable ARF/ARE complexes. 

To summarize, the FAST-1 SID appears to possess two subdomains 
10 that mediate the sequential association of FAST-1 with Smad2. which then 

associate with Smad4; ARF complexes must contain these three factors in order 

to stably bind the ARE. 

EXAMPLE VII 

Yeast-Two Hybrid Interaction of FAST-1 with Smads 

15 The activin-stimulated co-precipitation of FAST-1 with Smad2 and 

Smad4 demonstrated that these polypeptides are able to form a complex in the 
absence of the ARE DNA target, but did not address whether additional 
components of the activin signaling pathway are necessary for complex 
formation. In order to directly examine the physical interactions among Smad2, 

20 Smad4. and FAST-1, we tested these proteins for association in a yeast 

interaction trap system (Fields, supra). Portions of FAST-1 or Smad4 cloned 
into a GAL4 DNA binding domain fusion vector (pGBT9) were tested for their 
ability to interact with various Smad-GAL4 activator domain (pG AD424) 
fusions. 
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Methods 

Interaction trap constructs: Truncated derivatives of FAST-I and 
Smad genes were cloned into the shuttle/expression vectors pGBT9 and 
pGAD424 (or pGADIO) (Bartel et ah, Using the two-hybrid system to detect 
5 protein- protein interactions , pi 53- 179. In D. Hartley (ed.), Cellular 

Interactions in Development: A Practical Approach . Oxford Press, Oxford). A 
fusion of the GAL4-DNA-binding domain in the pGBT9 vector with each 
FAST-1 truncation derivative was generated, as was a fusion of the GAL4 
activation domain in the pGAD424/pGAD 1 0 vector with each Smad. 

10 Specifically, FAST-1-GAL4 DNA-binding domain fusion proteins in 

the pGBT9 vector included the following regions of FAST- 1 ; ( 1 ) FAST- 1 
N-domain and C-domain (aa61 to aa516); (2) FAST-1 C-termmus (aa366 to 
aa518); (3) FAST-1 A207-453 (aa61 to aa515, with 6 Myc epitope tags at the 
junction with the GAL4-DNA-binding domain, with aa207 to aa453 deleted); 

15 (4) FAST-1 A366-407 (aa61 to aa515, with aa366 to aa407 deleted); (5) 

FAST-1 A366-439 (aa61 to aa515, with aa366 to aa439 deleted); (6) FAST-1 
forkhead domain (aa56 to aa365). 

Smad-GAL4 activation domain fusion proteins in the pGAD424 or 
pGADIO vector were generated that included the following Smad regions: (1) 

20 Xenopus Smad2 MH2 domain (aa248 to aa467); (2) human Smadl MH2 
domain (aa249 to aa465); (3) full-length mouse Smad4 (aal to aa548); (4) 
mouse Smad4 MH2 domain (aa306 to aa548). In addition, full-length Smad4 
(aal to aa548) and the MH2 domain of Smad4 (aa306 to aa548) were cloned 
into the pGBT9 vector. 

25 Transformation and testing of yeast with two-hybrid clones: Yeast 

transformations, colony lift filter assays were carried out according to the 
MATCHMAKER Two-Hybrid System protocol (Clontech Laboratories, Inc., 
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Palo Alto, CA). For the filter assay, colony color was periodically observed 
during a 5-6 hour incubation at 30°C following initial exposure of 
permeabilized yeast to the Z buffer/X-gal solution. The liquid culture 
beta-galactosidase assay was performed according to the MATCHMAKER 
5 Two-Hybrid System protocol (Clontech Laboratories, Inc., Palo Alto, CA). 
O-nitrophenyl beta-D-galactopyranoside (ONPG) was used as a substrate in 
this assay. Beta-galactosidase units corresponding to each sample were 
calculated using the following equation: Beta galactosidase units = (1000 X 
OD 420 ) + (tV X OD 600 ) where: t= elapsed time (in minutes) of incubation. V= 

10 0. 1 ml X concentration factor of 5, OD 600 = Absorbance at 600 nm of 300 
microliters of Z buffer-washed and resuspended culture. 

Positive results were measured either as the development of blue 
color on X-Gal filter lifts of colonies expressing both activator and DNA 
binding domain constructs relative to colonies expressing each construct alone, 

15 or as a ratio of P-galactosidase activity in liquid cultures expressing activator 
and DNA binding domain constructs relative to colonies containing the DNA 
binding domain construct alone. 

Results 

As shown in Table 1 , the C-terminal third of FAST- 1 , to which the 
20 Smad2 co-immunoprecipitation function of FAST- 1 mapped (FAST-1 

366-518), interacted strongly with the MH2 domain of Smad2, whereas the 
winged helix domain region (FAST-1 56-365), did not. The Smad2 Ml 12 
domain, and Smad4 itself (in pGAD424 ? the activator domain construct) 
interacted with Smad4 when Smad4 was expressed in pGBT9 (DNA binding 
25 domain construct), confirming that the activator domain-Smad2 and -Smad4 
fusion proteins were expressed, and that these proteins physically interact 
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within the yeast assay. In contrast, the C-terminus of FAST- 1 did not interact 
detectably with the MH2 domain of Smadl, confirming the specificity of its 
interaction with the Smad2 MH2 domain. Nor did the FAST-1 C-terminus 
interact detectably with the MH2 domain of Smad4. 
5 Additional N-terminal deletions of the C-terminal third of FAST-1 

which allowed us to distinguish regions necessary for ARF/ARE complex 
formation and Smad4 association from those necessary for Smad2 
co-immunoprecipitation (i.e., FAST-1 A366-407), showed that the region of 
FAST-1 necessary for its interaction with Smad2 in yeast was similar to the 
10 FAST-1 region necessary for FAST-1/Smad2 co-immunoprecipitation. 

Although our results from the yeast interaction trap assay the possibility that 
additional proteins enhance Smad4/FAST-1 interactions (for example, yeast 
lack homologues for the activin signaling pathway), our results indicate that 
activin signaling is not a prerequisite for Smad2/FAST-1 interaction. 
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Tabje 1 

Interactions of Smads with Themselves and FAST-1 



Bait Construct Interactor Construct Color Intensity 

(Filters) 

5 



25 



FAST-I 
(aa61-518) 


Smad2 (MH2) 
Smadl (MH2) 
Smad4 (Full Length) 
omau4 (NiriJ.) 


+ 


FAST-1 
(aa366-518) 


Smad^ (MH2^ 
Smadl (MH2) 
Smad4 (Full Length) 
Smad4 (MH2) 


+ 


FAST-I 

(aa56-365: A366-518) 


Smad2 (MH2) 
Smadl (MH2) 




FAST-I 

(aa61-515: A366-407) 


Smad2 (MH2) 
Smadl (MH2) 


+ 


FAST-I 

(aa61-515:A366-439) 


Smad2 (MH2) 
Smadl (MH2) 


+ 


FAST-1 

(aa61-515: A207-453) 


Smad2 (MH2) 
Smadl (MH2) 


+ 



Smad4 (FL) Smad2 (MH2) + 

30 Smadl (MH2) -i 

Smad4 (Full Length) + 

Smad2 (MH2) Smad4 (MH2) +~ 
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EXAMPLE VIII 

Overexpressed FAST-1 SID d omains block s formation of the ARF 

The identification of a domain in FAST-1 that is necessary for 
interaction with Smads raised the possibility of using this domain to 
5 competitively inhibit activin signaling. Embryos were injected with mRNA 
encoding amino acids 366-518 of FAST-1 (the C-terminal third of FAST-1, 
containing the SID) and tested for their ability to respond to activin signals. 
We first asked whether the FAST-1 SID could inhibit activin-dependent 
ARF/ARE complex formation. 

10 Methods 

Two ng of mRNA encoding FAST-1 amino acids 366 to 51 8, plus or 
minus mRNA encoding activin, was co-injected into both blastomeres of 
two-cell Xenopus laevis embryos by the method described in Example I. Stage 
9 embryos were harvested and tested for ARF/ARE complex formation by 
1 5 EMS A as described in Example I. 

Results 

Control embryos (Fig. 7, lanes 1, 2) and embryos injected with 
mRNA encoding FAST-1 366-518 (Fig. 7, lanes 3, 4) plus (Fig. 7, lanes 2, 3) 
or minus (Fig. 7, lanes 1, 4) activin stimulation were harvested at Stage 9 and 
20 lysates were tested for ARF/ARE complex formation by EMSA. As shown in 
Fig. 7, overexpression of the Smad Interaction Domain of FAST-1 inhibited the 
formation of ARF/ARE complexes. 
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EXAMPLE IX 

Qverexpression of FAST- 1 SID Blocks Brachyury Induction and Animal Cap 
Induction bv Activin 

The activin/TGFp superfamily induces mesoderm formation in early 
5 Xenopus embryos. Induction of mesoderm may be inferred by the detection of 
brachyury, a marker of early mesoderm, and by animal cap elongation. 

We tested the effect of FAST SID overexpression upon activin- 
induced brachyury expression, and upon activin-induced animal cap elongation. 



Methods 

10 Animal cap assays and RT-PCR: Two ng of mRNA encoding the FAST-1 SID 
(FAST-1 ammo acids 366 to 518) and/or 150 pg Smad2 RNA were 
microinjected into both blastomeres of two-cell Xenopus laevis embryos by the 
method described in Example I. Animal caps were cut from Stage 8-9 blastulae 
and cultured in 

15 0.7 X MMR containing 0.1% gelatin, 100 jig/ml BSA, 250 jig/ml Gentamicin 
(GIBCO BRL), and 200 pM purified recombinant activin (Ajinomoto, Inc.), or 
100 ng/ml human recombinant bFGF (GIBCO BRL), either until control 
embryos reached Stage 10.5 (for RT-PCR), or until control embryos reached 
Stage 23/24 (for animal cap elongation photography). Staging of embryos was 

20 done according to Nieuwkoop and Faber, supra. 

Total RNA was extracted from embryos and animal caps at Stage 
10.5, and RT-PCR was performed as previously described (LaBonne and 
Whitman, Development 120: 463-472, 1994), using 20 amplification cycles for 
EF-la and 25 forXbra. 
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Results 

Animals caps from embiyos expressing FAST-1 SID (Fig. 8A, lanes 
6-8, 10, 1 1) in the absence (Fig. 8A, lanes 1-8) or presence (Fig. 8A, lanes 
9-11) of overexpressed Smad2 were tested for the induction of the 
5 pan-mesodermal marker brachyury (Xbra). Fig. 8A shows an agarose gel 

containing electrophoretically resolved RT-PCR products from RNA extracted 
from animal caps cut at Stage 8-9, treated with activin or FGF, and harvested 
for RNA at Stage 10.5. The ubiquitously-expressed marker EFla was also RT- 
PCR-amplified within each reaction, as an internal control for quantitation of 

10 brachyury PCR products. The symbols "+F" and "-F" denote samples from 

embryos that were injected with Smad2 mRN A plus or minus mRNA encoding 
the FAST-1 SID (FAST-1 aa366-518). 

Activin-dependent induction of brachyury was inhibited by 
overexpressed FAST-1 SID. Moreover, inhibition was specific for the 

1 5 activin/TGF-P signaling pathway, since induction of brachyury by bFGF was 
unaffected by the FAST-1 SID (Fig. 8A). Activin-dependent induction of the 
endo-mesodermal marker Mix. 1 also was inhibited by the FAST- 1 SID, as was 
induction of mesodermal markers by Smad2. However, overexprcssion of 
Smad2 partially restored activin-dependent gene expression in the presence of 

20 the FAST-1 SID. These observations suggest that inhibition of mesoderm- 
specific gene expression is due to the sequestration of Smads by the FAST-1 
SID. 

Fig. 8B shows a series of photographs of animal caps from control 
and experimentally-manipulated Xenopus embryos. Panel A shows 
25 unstimulated embryos; Panel B shows activin-stimulated embryos; Panel C 

shows activin-stimulated/ FAST-1 SID-microinjected embryos; Panel D shows 
FGF-stimulated embryos; Panel E shows FGF-stimulated/ FAST-1 SID- 
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micromjected embryos; and Panel F shows unstimulated/ FAST-1 SID- 
microinjected embryos. 

Activin-dependent elongation of animal caps, a marker of mesoderm 
induction, was inhibited by overexpressed FAST-1 SID (Fig. 8B, panel C). In 
5 contrast, like bFGF-induced brachyury expression, bFGF-dependent animal cap 
elongation was not inhibited by overexpressed FAST-1 SID (Fig. 8B, panels C 
and E). These results indicate that FAST-1 SID specifically inhibits the 
activin/TGF-P signaling pathway. 

EXAMPLE X 

10 Specific binding of FAST- 1 apd Smad2 domains in vitro 

The limitations of the yeast interaction trap method for monitoring 
protein-protein interactions (e.g., the difficulty of delivering compounds into 
yeast, and other non-specific effects due to yeast biology), made it desirable to 
develop an alternative method without such limitations. Hence, we developed 

15 an in vitro method to detect interactions between the FAST-1 Smad Interaction 
Domain (SID) and Smad2 MH2 domain. This method allows the detection of 
inhibitors of activin/TGFP superfamily signaling. 

Methods 

The SID of FAST-1 was fused to a GST tag, expressed in E. coli % 
20 and isolated by binding to glutathione sepharose as described in an earlier 

section. As a control, GST was expressed and purified in parallel. Myc-tagged 
Smadl or Smad2 MH2 domains were expressed in Xenopus laevis embryos by 
mRNA injection; Stage 9 embryos were lysed and assayed for MH2 domain 
expression level by Western blot analysis using anti-Myc antibodies (lane 1, 
25 uninfected; lane 2, injected with Myc-Smadl MH2; lane 3, injected with Myc- 
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Smad2 MH2). Lysates from Stage 9 embryos expressing Myc epitope-tagged 
Smadl (lanes 4 and 6) or Smad2 MH2 (lanes 5 and 7) domains were also 
incubated with GST-control (lanes 4 and 5) or GST-FAST- 1 SID (lanes 6 and 
7) fusion protein. Protein complexes were precipitated by binding to 
5 glutathione-coated beads and precipitated proteins were resolved by Laemmli 
gel electrophoresis and subjected to Western blot analysis with anti-Myc 
antibodies. 

Results 

10 Fig. 9 shows a Western blot analysis, using anti-Myc antibodies, of 

samples from uninjected embryos (lane 1), embryos expressing Myc-Smadl 
MH2 domain (lanes 2, 4, 6,) and embryos expressing Myc~Smad2 MH2 
domain (lanes 3, 5, 7). Lanes 1, 2, and 3 are whole lysates, lanes 4 and 5 are 
immunoprecipitates from lysates incubated with GST control protein, and lanes 

15 6 and 7 are immunoprecipitates from lysates incubated with GST-FAST- 1 SID. 
Fig. 9 shows that the FAST-1 SID specifically binds the Smad2 M1I2 domain, 
but not the Smadl MH2 domain (lane 7). That fact that these results, 
originally detected using the yeast interaction trap assay described in Example 
VII, may also be observed in our in vitro assay, confirms the validity of this 

20 approach for monitoring specific FAST- 1/Smad2 interactions. 

EXAMPLE XI 

Identification of human and mouse homologues of FAST-1 

Since TGF-J3 superfamily signaling affects the development of a 
wide variety of organisms, we isolated cDNAs encoding mammalian 
25 homologues of Xenopus FAST- 1 . 

We searched publicly available sequence databases for sequences with identity 
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to the amino acid sequence of full length Xenopus FAST-1 (Chen et aL, supra), 
and for sequences with identity to amino acids 380 to 506 of FAST-1 
(corresponding to the FAST-1 SID). 

Our search for sequences with identity to foil length FAST-1 
5 identified no candidate FAST-1 homologues. However, by using the amino 
acid sequences corresponding to the FAST-1 SID as a probe to screen sequence 
listing databases, we identified one sequence in the TIGR Human Gene Index 
(TIGR clone ID No. 64997; clones in the TIGR index are commercially 
available) with identity to the FAST-1 SID. The Genbank accession number of 

10 the human FAST-1 SID partial sequence is AA2 18611. 

The human FASTI SID clone had an insert size of approximately 
300 base pairs, corresponding to a 100 amino acid long polypeptide. A cDNA 
clone encoding full-length human FAST-1 was isolated by screening a human 
cDNA library, by standard techniques, using the fragment encoding the human 

15 FAST-1 SID as a probe. 

Degenerate primers were designed that correspond to regions 
conserved between human and Xenopus FAST-1 . The primers were used in 
PCR reactions that contained cDNA from mouse embryonic stem cells as a 
template. A partial cDNA encoding mouse FAST- 1 was obtained, which was 

20 used to screen mouse cDNA and genomic libraries to obtain the full-length 
mouse FAST-1 sequence. 



EXAMPLE XII 

F AST-1 -like activity bv the human FAST-1 homologue 

We tested the candidate human homologue of FAST-1 for the ability 
25 to co-immunoprecipitate with Smad2. 
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Methods 

The 300 base pair insert encoding the human FAST-1 SID was 
tagged with the Myc epitope by subcloning the insert into the pCS2(+)MT 
vector. 

5 Myc-tagged human FAST-1 was co-expressed with GST-tagged 

Smad2 in Xenopus embryos in the presence or absence of activin stimulation. 
The microinjected embryos were then lysed and immunoprecipitated with 
anti-GST antibody, followed by blotting with anti-Myc antibody, as described 
in Example IV. 



10 Results 

The human Myc-tagged FAST-1 SID co-immunoprecipitates with 
GST-tagged Smad2, indicating that human FAST-1, like Xenopus FAST-1, is 
able to associate with Smad2. Also like Xenopus FAST-1, the human FAST-1 
SID co-immuno-precipitates Smad4 in an activin-dependent-manner. 
15 An amino acid sequence alignment of human, mouse, and Xenopus 

FAST-1 is shown in Fig. 10. Regions of FAST-1 polypeptides having identical 
amino acids are boxed. The human and mouse FAST-1 SIDs are comprised, at 
maximum, of human FAST-1 amino acids 234-365, and mouse FAST-1 amino 
acids 309-398. 



20 Other Embodiments; 

All publications and patent applications mentioned in this 
specification are herein incorporated by reference to the same extent as if each 
independent publication or patent application was specifically and individually 
indicated to be incorporated by reference. 

25 While the invention has been described in connection with specific 
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embodiments thereof, it will be understood that it is capable of further 
modifications and this application is intended to cover any variations, uses, or 
adaptations of the invention following, in general, the principles of the 
invention and including such departures from the present disclosure come 
5 within known or customary practice within the art to which the invention 
pertains and may be applied to the essential features hereinbefore set forth. 
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SSQUENCE LISTING 

(1) GENERAL INFORMATION 
(i) APPLICANT: THE PRESIDENT AND FELLOWS OF HARVARD COLLEGE 

(ii) TITLE OF THE INVENTION: METHODS AND REAGENTS FOR 
MODULATING TGF-3ETA SUPERFAMILY SIGNALLING 
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(iv) CORRESPONDENCE ADDRESS: 

<A) ADDRESSEE: Clark & Elbing LLP 
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(A) MEDIUM TYPE: Diskette 
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(viii) ATTORNEY/ AGENT INFORMATION: 
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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1658 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(ix) FEATURE : 
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(A) NAME /KEY : Other 

(B) LOCATION: 1 ... 1 

(D) OTHER INFORMATION: Xenopus Smad2 coding region 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO : 1 : 

ATGTCGTCCA TCTTGCCTTT CACCCCGCCA GTAGTGAAGC GCCTGCTAGG ATGGAAGAAG 60 

TCTGCAAGTG GCACCACAGG AGCAGGTGGC GATGAGCAGA ACGGACAGGA AGAGAAGTGG 120 

TGCGAAAAAG CGGTAAAGAG CTTGGTGAAA AAACTGAAGA AAACGGGACA ATTAGACGAG 180 

CTTGAGAAGG CGATCACGAC GCAGAACTGC AACACGAAAT GCGTAACGAT ACCAAGCACT 240 

TGCTCTGAAA TTTGGGGACT GAGTACAGCA AATACCATAG ATCAGTGGGA TACCACAGGC 300 

CTTTACAGCT TCTCTGAACA AACCAGGTCT CTTGATGGTC GACTCCAGGT GTCTCACCGT 360 

AAAGGATTGC CGCATGTTAT CTACTGCAGA CTGTGGCGCT GGCCAGACCT GCACAGTCAT 4 20 

CATGAACTGA AAGCAATCGA AAATTGTGAA TATGCTTTTA ACCTTAAAAA AGATGAAGTT 4 80 

TGTGTCAATC CATACCATTA TCAGAGGGTG GAGACACCAG TTTTACCACC TGTATTAGTT 54 0 

CCACGGCACA CGGAAATCTT GACAGAGCTG CCACCTCTTG ATGACTACAC GCATTCCATT 600 

CCAGAAAACA CTAATTTTCC TGCAGGGATT GAACCTCAGA GCAATTATAT TCCAGAAACA 660 

CCACCTCCTG GATATATTAG TGAAGATGGA GAAACTAGCG ATCAGCAACT TAACCAAAGC 720 

ATGGACACAG GGTCACCAGC TGAGCTGTCT CCGAGTACAC TTTCTCCAGT CAACCACAAT 760 

CTCGATTTGC AACCTGTCAC CTATTCGGAA CCTGCTTTTT GGTGCTCTAT AGCATACTAC 84 0 

GAACTGAATC AGCGAGTAGG AGAAACTTTC CATGCATCGC AACCATCGCT TACCGTGGAC 900 

GGCTTTACGG ACCCCTCAAA CTCTGAAAGG TTCTGCTTAG GTTTACTCTC AAATGTGAAC 960 

CGAAATGCCA CGGTGGAAAT GACCAGGCGT CACATAGGAA GGGGTGTCCG GCTATATTAC 1020 

ATCGGTGGAG AGGTGTTTGC AGAGTGCCTA AGTGATAGTG CTATTTTTGT TCAGAGTCCA 10 SO 

AACTGTAACC AGCGATATGG ATGGCATCCA GCAACTGTAT GTAAGATTCC TCCAGGATGC 114 0 

AATCTGAAGA TTTTCAATAA TCAAGAGTTT GCGGCTCTCC TCGCTCAGTC TGTGAATCAA 1200 

GGCTTTGAAG CAGTTTATCA GTTAACTCGA ATGTGCACCA TAAGGATGAG CTTTGTAAAA 126 0 

GGCTGGGGTG CTGAATACAG GCGACAGACC GTTACAAGCA CTCCATGCTG GATTGAGCTT 13 20 

CACCTGAATG GACCTTTGCA GTGGTTGGAC AAAGTGTTGA CACAGATGGG ATCCCCTTCA 13 80 

GTCCGCTGCT CAAGCATGTC CTAATGGTCT CCTCTTTTTA ATGTATTACC TGCGGGCGGC 14 40 

AACTGCAGTC CCAGCAACAG ACTCAATACA GCTTGTCTGT CGTAGTATTT GTGTGTGGTG 1500 

CCCATGAACT GTTTACAATC CAAAAGAGAG AGAATAAAAA AG C AAAAAC A GCACTTGAGA 1560 

TCCCATCAAC GAAAAGCACC TTGTTGGATG ATGTTTCTGA TACTCTTAAA GTAGATCCGT 1620 

GTATAAATGA CTCCTTACCT GGGAAAAGGG ACTTTTTC 16 56 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE : protein 
(ix) FEATURE: 

(A) NAME/ KEY: Ozher 

(B) LOCATION: 1. . .1 

(D) OTHER INFORMATION: Xencpus Smad2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Ser Ser He Leu Pro Phe Thr Pro Pre Val Val Lys Arg Leu Leu 

15 10 15 

Gly Trp Lys Lys Ser Ala Ser Gly Thr Thr Gly Ala Gly Gly Asp Glu 

20 25 30 

Gin Asn Gly Gin Glu Glu Lys Trp Cys Glu Lys Ala Val Lys Ser Leu 
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35 

Val Lys Lys Leu 
50 

He Thr Thr Gin 
65 

Cys Ser Glu lie 

Asp Thr Thr Gly 
100 

Gly Arg Leu Gin 
115 

Cys Arg Leu Trp 
130 

Ala He Glu Asn 
145 

Cys Val Asn Pro 

Pro Val Leu Val 
190 

Leu Asp Asp Tyr 
195 

Gly He Giu Pro 
210 

Tyr He Ser Glu 
225 

Met Asp Thr Gly 

Val Asn His Asn 
260 

Phe Trp Cys Ser 
275 

Thr Phe His Ala 
2S0 

Pro Ser Asn Ser 
305 

Arc Asn Ala Thr 

Arc Leu Tyr Tyr 
340 

Ser Ala lie Phe 
355 

His Pro Ala Thr 
370 

Phe Asn Asn Gin 
385 

Gly Phe Glu Ala 

Ser Phe Val Lys 
420 

Ser Thr Pro Cys 
435 

Leu Asp Lys Val 
450 

Ser Met Ser 
465 



40 

Lys Lys Thr Gly 
55 

Asn Cys Asn Thr 
70 

Trp Gly Leu Ser 
85 

Leu Tyr Ser Phe 

Val Ser His Arg 
120 

Arg Trp Pro Asp 
135 

Cys Glu Tyr Ala 
150 

Tyr His Tyr Gin 
165 

Pro Arg His Thr 

Thr His Ser He 
200 

Gin Ser Asn Tyr 
215 

Asp Gly Glu Thr 
230 

Ser Pro Ala Glu 
245 

Leu Asp Leu Gin 

He Ala Tyr Tyr 
280 

Ser Gin Pro Ser 
295 

Glu Arg Phe Cys 
310 

Val Glu Met Thr 
325 

He Gly Gly Glu 

Val Gin Ser Pro 
360 

Val Cys Lys He 
375 

Glu Phe Ala Ala 
390 

Val Tyr Gin Leu 
405 

Gly Trp Gly Ala 

Trp He Glu Leu 
440 

Leu Thr Gin Met 
455 



-53- 



Gln Leu Asp Glu 
60 

Lys Cys Val Thr 
75 

Thr Ala Asn Thr 
90 

Ser Glu Gin Thr 
105 

Lys Gly Leu Pro 

Leu His Ser His 
140 

Phe Asn Leu Lys 
155 

Arg Val Glu Thr 
170 

Glu He Leu Thr 
185 

Pro Glu Asn Thr 

He Pro Glu Thr 
220 

Ser Asp Gin Gin 
235 

Leu Ser Pro Ser 
250 

Pro Val Thr Tyr 

2 65 

Glu Leu Asn Gin 

Leu Thr Vai Asp 
300 

Leu Gly Leu Leu 
315 

Arg Arg His He 
330 

Val Phe Ala Glu 
345 

Asn Cys Asn Gin 

Pro Pro Gly Cys 
380 

Leu Leu Ala Gin 

395 

Thr Arg Met Cys 
4 10 

Glu Tyr Arg Arg 
425 

His Leu Asn Gly 

Gly Ser Pro Ser 
460 



45 

Leu Glu Lys Ala 

He Pro Ser Thr 
80 

He Asp Gin Trp 
95 

Arg Ser Leu Asp 
110 

His Val He Tyr 
125 

His Glu Leu Lys 

Lys Asp Glu Val 
160 

Pro Vai Leu Pro 
175 

Glu Leu Pro Pro 
190 

Asn Phe Pro Ala 
205 

Pro Pro Pro Gly 

Leu Asn Gin Ser 
240 

Thr Leu Ser Pro 
255 

Ser Glu Pro Ala 

270 

Arg Val Gly Glu 
285 

Gly Phe Thr Asp 

Ser Asn Val Asn 

320 

Gly Arg Gly Val 

335 

Cys Leu Ser Asp 
350 

Arg Tyr Gly Trp 
365 

Asn Leu ^ys lie 

Ser Val Asn Gin 
400 

Thr He Arg Met 
41b 

Gin Thr Val Thr 
4 3 0 

Pro Leu Gin Trp 
445 

Val Arc Cys Ser 



(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 
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{ A) LENGTH; 194 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 
(ix) FEATURE: 

(A) NAME /KEY : Other 

(B) LOCATION: 1...1 

(D) OTHER INFORMATION: Xenopus Smad2 MH2 domain 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



Trp 


Cys 


Ser 


He 


Ala 


Tyr 


Tyr 


Glu 


Leu Asn Gin Arg Val 


Gly 


Glu 


Thr 


1 








5 








10 




15 




Phe 


His 


Ala 


Ser 


Gin 


Pro 


Ser 


Leu 


Thr Val Asp Gly Phe 


Thr 


Asp 


Pro 








20 










25 


30 






Ser 


Asn 


Ser 


Glu 


Arg 


Phe 


Cys 


Leu 


Gly Leu Leu Ser Asn 


Val 


Asn 


Arg 






35 










40 


45 








Asn 


Ala 


Thr 


Val 


Glu 


Met 


Thr 


Arg 


Arg His He Gly Arg 


Gly 


Val 


Arg 




50 










55 




60 








Leu 


Tyr 


Tyr 


He 


Gly 


Gly 


Glu 


Val 


Phe Ala Glu Cys Leu 


Ser 


Asp 


Ser 


65 










70 






75 






B0 


Ala 


He 


Phe 


Val 


Gin 


Ser 


Pro 


Asn 


Cys Asn Gin Arg Tyr 


Gly 


Trp 


His 










85 








90 




95 




Pro 


Ala 


Thr 


Val 


Cys 


Lys 


He 


Pro 


Pro Gly Cys Asn Leu 


Lys 


He 


Phe 








100 










105 


110 






Asn 


Asn 


Gin 


Glu 


Phe 


Ala 


Ala 


Leu 


Leu Ala Gin Ser Val 


Asn 


Gin Gly 






115 










12 0 


125 








Phe 


Glu 


Ala 


Val 


Tyr 


Gin 


Leu 


Thr 


Arg Met Cys Thr He 


Arg 


Met 


Ser 




13C 










135 




140 








Phe 


Val 


Lys 


Gly 


Trp 


Gly 


Ala 


Glu 


Tyr Arg Arg Gin Thr 


Val 


Thr 


Ser 


145 










150 






155 






160 


Thr 


Pro 


Cys 


Trp 


He 


Glu 


Leu 


His 


Leu Asn Gly Pro Leu 


Gin 


Trp 


Leu 










165 








170 




17 5 




Asp 


Lys 


Val 


Leu 


Thr 


Gin 


Met 


Gly 


Ser Pre Ser Val Arg 


Cys 


Ser 


Ser 



180 185 190 

Me: Ser 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1401 base pairs 
(E) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A 1 , NAME /KEY : Other 
(B; LOCATION: 1...1 

(d; OTHER INFORMATION: Human Smad2 ceding region 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 4 : 
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ATGTCGTCCA TCTTGCCATT CACGCCGCCA GTTGTGAAGA GACTGCTGGG ATGGAAGAAG 6 0 

TCAGCTGGTG GGTCTGGAGG AGCAGGCGGA GGAGAGCAGA ATGGGCAGGA AGAAAAGTGG 12 0 

TGTGAGAAAG CAGTGAAAAG TCTGGTGAAG AAGCTAAAGA AAACAGGACG ATTAGATGAG 18 0 

CTTGAGAAAG CCATCACCAC TCAAAACTGT AATACTAAAT GTGTTACCAT ACCAAGCACT 24 0 

TGCTCTGAAA TTTGGGGACT GAGTACACCA AATACGATAG ATCAGTGGGA TACAACAGGC 300 

CTTTACAGCT TCTCTGAACA AACCAGGTCT CTTGATGGTC GTCTCCAGGT ATCCCATCGA 360 

AAAGGATTGC CACATGTTAT ATATTGCCGA TTATGGCGCT GGCCTGATCT TCACAGTCAT 42 0 

CATGAACTCA AGGCAATTGA AAACTGCGAA TATGCTTTTA ATCTTAAAAA GGATGAAGTA 480 

TGTGTAAACC CTTACCACTA TCAGAGAGTT GAGACACCAG TTTTGCCTCC AGTATTAGTG 54 0 

CCCCGACACA CCGAGATCCT AACAGAACTT CCGCCTCTGG AT G ACT AT AC TCACTCCATT 600 

CCAGAAAACA CTAACTTCCC AGCAGGAATT GAGCCACAGA GTAATTATAT TCCAGAAACG 66 0 

CCACCTCCTG GATATATCAG TGAAGATGGA GAAACAAGTG ACCAACAGTT GAATCAAAGT 72 0 

ATGGACACAG GCTCTCCAGC AGAACTATCT CCTACTACTC TTTCCCCTGT TAATCATAGC 780 

TTGGATTTAC AGCCAGTTAC TTACTCAGAA CCTGCATTTT GGTGTTCAAT AGCATATTAT 84 0 

GAATTAAATC AGAGGGTTGG AGAAACCTTC CATGCATCAC AGCCCTCACT CACTGTAGAT 900 

GGCTTTACAG AC C CATC AAA TTCAGAGAGG TTCTGCTTAG GTTTACTCTC CAATGTTAAC 96 0 

CGAAATGCCA CGGTAGAAAT GACAAGAAGG CATATAGGAA GAGGAGTGCG CTTATACTAC 102 0 

ATAGGTGGGG AAGTTTTTGC TGAGTGCCTA AGTGATAGTG CAATCTTTGT GCAGAGCCCC 1080 

AATTGTAATC AGAGATATGG CTGGCACCCT GCAACAGTGT GTAAAATTCC ACCAGGCTGT 114 0 

AAT CTGAAGA TCTTCAACAA CCAGGAATTT GCTGCTCTTC TGGCTCAGTC TGTTAATCAG 1200 

GGTTTTGAAG CCGTCTATCA GCTAACTAGA ATGTGCACCA TAAGAATGAG TTTTGTGAAA 126 0 

GGGTGGGGAG CAGAATACCG AAGGCAGACG GTAACAAGTA CTCCTTGCTG GATTGAACTT 132 0 

CATCTGAATG GACCTCTACA GTGGTTGGAC AAAGTATTAA CTCAGATGGG ATCCCCTTCA 1380 

GTGCGTTGCT CAAGCATGTC A 1401 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 467 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME /KEY : Other 

(B) LOCATION: 1. . .1 

(D) OTHER INFORMATION: Human Smad2 



(xi) SEQUENCE 

Met Ser Ser lie Leu 

1 5 
Gly Trp Lys Lys Ser 
20 

Gin Asn Gly Gin Glu 
35 

Val Lys Lys Leu Lys 
50 

lie Thr Thr Gin Asn 
€5 

Cys Ser Glu lie Trp 
85 

Asp Thr Thr Gly Leu 
100 

Gly Arg Leu Gin Val 
115 



DESCRIPTION: SEC TD 

Pro ?he Thr Pro Pro 
10 

Ala Gly Gly Ser Gly 
25 

Glu Lys Trp Cys Glu 
40 

Lys Thr Gly Arg Leu 
55 

Cys Asn Thr Lys Cys 
70 

Gly Leu Ser Thr Pro 
90 

Tyr Ser Phe Ser Glu 
105 

Ser His Arg Lys Gly 
120 



NO : 5 : 

Val Val Lys Arg Leu Leu 
15 

Gly Ala Gly Gly Gly Glu 

30 

Lys Ala Val Lys Ser Leu 
45 

Asp Glu Leu Glu Lys Ala 
50 

Val Thr He Pro Ser Thr 
75 30 
Asn Thr He Asp Gin Trp 
95 

Glr. Thr Arg Ser Leu Asp 
110 

Leu Pro His Val He Tyr 
125 
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Cys Arg 


Leu 


Trp 


Arg 


Trp 


Pro 


Asp 


Leu 


His Ser 


His 


His 


Glu 


Leu Lys 


130 










135 








140 








Ala lie 


Glu 


Asn 


Cys 


Glu 


Tyr 


Ala 


Phe 


Asn Leu 


Lys 


Lys 


Asp Glu Val 


145 








150 








155 








160 


Cys Val 


Asn 


Pro 


Tyr 


His 


Tyr 


Gin 


Arg Val Glu 


Thr 


Pro 


Val 


Leu Pro 








165 










170 








175 


Pro Val 


Leu 


Val 


Pro 


Arg 


His 


Thr 


Glu 


He Leu 


Thr 


Glu 


Leu 


Pro Pro 






180 










185 








190 




Leu Asp 


Asp 


Tyr 


Thr 


His 


Ser 


He 


Pro 


Glu Asn 


Thr 


Asn 


Phe 


Pro Ala 




195 










200 








205 






Gly lie 


Glu 


Pro 


Gin 


Ser 


Asn 


Tyr 


He 


Pro Glu 


Thr 


Pro 


Pro Pro Gly 


210 










215 








220 








Tyr He 


Ser 


Glu 


Asp 


Gly 


Glu 


Thr 


Ser 


Asp Gin 


Gin 


Leu 


Asn 


Gin Ser 


225 








230 








235 








240 


Met Asp 


Thr 


Gly 


Ser 


Pro 


Ala 


Glu 


Leu 


Ser Pro 


Thr 


Thr 


Leu 


Ser Pro 








245 










250 








255 


Val Asn 


His 


Ser 


Leu 


Asp 


Leu 


Gin 


Pro 


Val Thr 


Tyr 


Ser 


Glu 


Pro Ala 






260 










265 








270 




Phe Trp 


Cys 


Ser 


He 


Ala 


Tyr 


Tyr 


Glu 


Leu Asn 


Gin 


Arg 


Val 


Gly Glu 




275 










280 








285 






Thr Phe 


His 


Ala 


Ser 


Gin 


Pro 


Ser 


Leu 


Thr Val 


Asp Gly 


Phe 


Thr Asp 


290 










295 








300 








Pro Ser 


Asn 


Ser 


Glu 


Arg 


Phe 


Cys 


Leu Gly Leu Leu 


Ser 


Asn 


Val Asn 


305 








310 








315 








320 


Arg Asn 


Ala 


Thr 


Val 


Glu 


Met 


Thr 


Arg Arg His 


He 


Gly 


Arg 


Gly Val 








325 










330 








335 


Arg Leu 


Tyr 


Tyr 


He 


Gly 


Gly 


Glu 


Val 


Phe Ala 


Glu 


Cys 


Leu 


Ser Asp 






340 










345 








350 




Ser Ala 


He 


Phe 


Val 


Gin 


Ser 


Pro 


Asn 


Cys Asn Gin Arg 


Tyr 


Gly Trp 




355 










360 








365 






His Pro 


Ala 


Thr 


Val 


Cys 


Lys 


He 


Pro 


Pro Gly 


Cys 


Asn 


Leu 


Lys He 


370 










375 








380 








Phe Asn 


Asn 


Gin 


Glu 


Phe 


Ala 


Ala 


Leu 


Leu Ala 


Gin 


Ser 


Val 


Asn Gin 


385 








390 








395 








400 


Gly Phe 


Glu 


Ala 


Val 


Tyr 


Gin 


Leu 


Thr 


Arg Met 


Cys 


Thr 


He 


Arg Met 








405 










410 








415 


Ser Phe 


Val 


Lys 


Gly 


Trp 


Gly 


Ala 


Glu 


Tyr Arg 


Arg 


Gin 


Thr 


Val Thr 






420 










425 








430 




Ser Thr 


Pro 


Cys 


Trp 


He 


Glu 


Leu 


His 


Leu Asn 


Gly 


Pro 


Leu 


Gin Trp 




435 










440 








445 






Leu Asp 


Lys 


Val 


Leu 


Thr 


Gin 


Met 


Gly 


Ser Pro 


Ser 


Val 


Arg 


Cys Ser 



450 455 460 



Ser Met Ser 
465 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 amino acids 

(B) TYPE: amino acid 

{C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

£ii) MOLECULE TYPE : protein 
(ix) FEATURE : 

(A) NAME/ KEY : Other 

(B) LOCATION: 1...1 

(D) OTHER INFORMATION: Human Smad2 MH2 domain 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 



Trp Cys 


Ser 


lie 


Ala Tyr 


Tyr 


Glu 


Leu 


Asn Gin Arg Val 


Gly Glu Thr 


1 








5 








10 


15 


Phe 


His 


Ala 


Ser 


Gin Pro 


Ser 


Leu 


Thr 


Val Asp Gly Phe 


Thr Asp Pro 








20 








25 




30 


Ser 


Asn 


Ser 


Glu 


Arg Phe 


Cys 


Leu 


Gly 


Leu Leu Ser Asn 


Val Asn Arg 






35 








40 




45 




Asn 


Ala 


Thr 


Val 


Glu Met 


Thr 


Arg 


Arg 


His He Gly Arg 


Gly Val Arg 




50 








55 






60 




Leu Tyr Tyr 


lie 


Gly Gly 


Glu 


val 


Phe 


Ala Glu Cys Leu 


Ser Asp Ser 


65 








70 








75 


80 


Ala 


lie 


Phe 


Val 


Gin Ser 


Pro 


Asn 


Cys 


Asn Gin Arg Tyr 


Gly Trp His 










85 








90 


95 


Pro 


Ala 


Thr 


Val 


Cys Lys 


He 


Pro 


Pro 


Gly Cys Asn Leu 


Lys He Phe 








100 








105 




110 


Asn 


Asn 


Gin 


Glu 


Phe Ala 


Ala 


Leu 


Leu 


Ala Gin Ser Val 


Asn Gin Gly 






115 








120 




125 




Phe 


Glu 


Ala 


Val 


Tyr Gin 


Leu 


Thr 


Arg 


Met Cys Thr He 


Arg Net Ser 




130 








135 






140 




Phe 


Val 


Lys 


Gly 


Trp Gly 


Ala 


Glu 


Tyr 


Arg Arg Gin Thr 


Val Thr Ser 


145 








150 








155 


160 


Thr 


Pro 


Cys 


Trp 


lie Glu 


Leu 


His 


Leu 


Asn Gly Pro Leu 


Gin Trp Leu 










165 








170 


175 


Asp 


Lys 


Val 


Leu 


Thr Gin 


Met 


Gly 


Ser 


Pro Ser Val Arg 


Cys Ser Ser 








130 








185 




190 



Met Ser 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 2234 base pairs 
<B) TYPE: nucleic acid 

(C) STRAKDEDNESS: single 
tD> TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(3) LOCATION: 1. . .1272 
(D) OTHER INFORMATION: 



(A) NAME /KEY : Other 
(3) LOCATION: 1 . . .3 

ID) OTHER INFORMATION: Human Smad3 coding region 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

ATG TCG TCC ATC CTG CCT TTC ACT CCC CCG ATC GTG AAG CGC CTG CTG 4 8 

Met Ser Ser lie Leu Pro Phe Thr Pro Pro lie Val Lys Arg Leu Leu 
15 10 15 



GGC TGG AAG AAG GGC GAG CAG AAC 



GGG CAG GAG GAG AAA. TGG TGC GAG 



96 
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Gly Trp Lys Lys Gly Glu Gin Asn Gly Gin Glu Glu Lys Trp Cys Glu 
20 25 30 

AAG GCG GTC AAG AGC CTG GTC AAG AAA CTC AAG AAG ACG GGG CAG CTG 144 
Lys Ala Val Lys Ser Leu Val Lys Lys Leu Lys Lys Thr Gly Gin Leu 
35 40 45 

GAC GAG CTG GAG AAG GCC ATC ACC ACG CAG AAC GTC AAC ACC AAG TGC . 192 
Asp Glu Leu Glu Lys Ala lie Thr Thr Gin Asn Val Asn Thr Lys Cys 
50 55 60 

ATC ACC ATC CCC AGG TCC CTG GAT GGC CGG TTG CAG GTG TCC CAT CGG 24 0 

He Thr He Pro Arg Ser Leu Asp Gly Arg Leu Gin Val Ser His Arg 
65 70 75 80 

AAG GGG CTC CCT CAT GTC ATC TAC TGC CCT GTG CGA TGG CCA GAC CTG 288 
Lys Gly Leu Pro His Val lie Tyr Cys Pro Val Arg Trp Pro Asp Leu 
85 90 95 

CAC AGC CAC CAC GAG CTG CGG GCC ATG GAG CTG TGT GAG TTC GCC TTC 336 
His Ser His His Glu Leu Arg Ala Met Glu Leu Cys Glu Phe Ala Phe 
100 105 110 

AAT ATG AAG AAG GAC GAG GTC TGC GTG AAT CCC TAC CAC TAC CAG AGA 3R4 
Asn Met Lys Lys Asp Glu Val Cys Val Asn Pro Tyr His Tyr Gin Arg 
115 120 125 

GTA GAG ACA CCA GTT CTA CCT CCT GTG TTG GTG CCA CGC CAC ACA GAG 4 32 

Val Glu Thr Pro Val Leu Pro Pro Val Leu Val Pro Arg His Thr Glu 
130 135 140 

ATC CCG GCC GAG TTC CCC CCA CTG GAC GAC TAC AGC CAT TCC ATC CCC 4 80 

He Pro Ala Glu Phe Pro Pro Leu Asp Asp Tyr Ser His Ser He Pro 
145 15C 155 160 

GAA AAC ACT AAC TTC CCC GCA GGC ATC GAG CCC CAG AGC AAT ATT CCA 52 3 

Glu Asn Thr Asn Phe Pro Ala Gly He Glu Pro Gin Ser Asn He Pro 
165 170 175 

GAG ACC CCA CCC CCT GGC TAC CTG AGT GAA GAT GGA GAA ACC AGT GAC 5 76 

Glu Thr Pro Pro Pro Gly Tyr Leu Ser Glu Asp Gly Glu Thr Ser Asp 
180 185 190 

CAC CAG ATG AAC CAC AGC ATG GAC GCA GGT TCT CCA AAC CTA TCC CCG 6 24 

His Gin Met Asn His Sor Met Asp Ala Gly Ser Pro Asn Leu Ser Pro 
195 200 205 

AAT CCG ATG TCC CCA GCA CAT AAT AAC TTG GAC CTG CAG CCA GTT ACC 6 72 

Asn Pro Met Ser Pro Ala Has Asn Asn Leu Asp Leu Gin Pro Val Thr 
210 215 220 

TAC TGC GAG CCG GCC TTC TGG TGC TCC ATC TCC TAC TAC GAG CTG AAC 72 0 

Tyr Cys Glu Pro Ala Phe Trp Cys Ser He Ser Tyr Tyr Glu Leu Asn 
225 230 235 240 

CAG CGC GTC GGG GAG ACA TTC CAC GCC TCG CAG CCA TCC ATG ACT GTG 7 63 

Gin Arg Val Gly Glu Thr Phe His Ala Ser Gin Pro Ser Met Tnr Val 
245 250 255 
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GA7 GGC TTC ACC GAC CCC TCC AAT TCG GAG CGC TTC TGC CTA GGG CTG 816 
Asp Gly Phe Thr Asp Pro Ser Asn Ser Glu Arg Phe Cys Leu Gly Leu 
260 265 270 

CTC TCC AAT GTC AAC AGG AAT GCA GCA GTG GAG CTG ACA CGG AGA CAC 864 
Leu Ser Asn Val Asn Arg Asn Ala Ala Val Glu Leu Thr Arg Arg His 
275 280 285 

ATC GGA AGA GGC GTG CGG CTC TAC TAC ATC GGA GGG GAG GTC TTC GCA 912 
lie Gly Arg Gly Val Arg Leu Tyr Tyr lie Gly Gly Glu Val Phe Ala 
290 295 300 

GAG TGC CTC AGT GAC AGC GCT ATT TTT GTC CAG TCT CCC AAC TGT AAC 96 0 

Glu Cys Leu Ser Asp Ser Ala He Phe Val Gin Ser Pro Asn Cys Asn 
305 310 315 320 

CAG CGC TAT GGC TGG CAC CCG GCC ACC GTC TGC AAG ATC CCA CCA GGA 10 08 
Gin Arg Tyr Gly Trp Kis Pro Ala Thr Val Cys Lys He Pro Pro Gly 
325 330 335 

TGC AAC CTG AAG ATC TTC AAC AAC CAG GAG TTC GCT GCC CTC CTG GCC 105 6 
Cys Asn Leu Lys He Phe Asn Asn Gin Glu Phe Ala Ala Leu Leu Ala 
340 345 350 

CAG TCG GTC AAC CAG GGC TTT GAG GCT GTC TAC CAG TTG ACC CGA ATG 1104 
Gin Ser Val Asn Gin Gly Phe Glu Ala Val Tyr Gin Leu Thr Arg Met 
355 360 ^ 365 

TGC ACC ATC CGC ATG AGC TTC GTC AAA GGC TGG GGA GCG GAG TAC AGG 7:152 
Cys Thr He Arg Met Ser Phe Val Lys Gly Trp Gly Ala Glu Tyr Arg 
370 375 380 

AGA CAG ACT GTG ACC AGT ACC CCC TGC TGG ATT GAG CTG CAC CTG AAT 12C0 
Arg Gin Thr Val Thr Ser Thr Pro Cys Trp He Glu Leu His Leu Asn 
385 390 395 400 

GGG CCT TTG CAG TGG CTT GAC AAG GTC CTC ACC CAG ATG GGC TCC CCA 1248 
Gly Pro Leu Gin Trp Leu Asp Lys Val Leu Thr Gin Met Gly Ser Pro 
405 410 415 

AGC ATC CGC TGT TCC AGT GTG TCT TAG AG A CATC AAGTATGGTA GGGGAGGGCA 13 02 
Ser He Arg Cys Ser Ser Val Ser 
42C 

GGCTTGGGGA AAATGGCCAT AOVGGAGGTG 3AGAAAATTG GAACTCTACT CAACCCATTG 13 62 

TTGTCAAGGA AGAAGAAATC TTTCTCCCTC AACTGAAGGG GTGCACCCAC CTGTTTTCTG 14 2 2 

AAACACACGA GCAAACCCAG AGGTGGATGT TATGAACAGC TGTGTCTGCC A*\ACACATTT 1482 

ACCCTTTGGC CCCACTTTGA AGGGCAAGAA ATGGCGTCTG CTCTGGTGGC TTAAGTGAGC 154 2 

AGAACAGGTA GTATTACACC ACCGGCACCC TCCCCCCAGA CTCTTTTTTT GAG T GAC AGC 16 02 

TTTCTGGGAT GTCACAGTCC AACCAGAAAC GCCCCTCTGT CTAGGACTGC AGTGTGGAGT 166 2 

TCACCTTGGA AGGGCGTTCT AGGTAGGAAG AGCCCGCACG ATGCAGACCT CATGCCCAGC 172 2 

TCTCTGACGC TTGTGACAGT GCCTCT7CCA GTGAACATTC CCAGCCCAGC CCCGCCCCGT 1762 

TGTGAGCTGG ATAGACTTGG GATGGGGAGG GAGGGAGTTT TGTCTGTCTC CCTCCCCTCT 184 2 

CAGAACATAC TGATTGGGAG GTGCGTGTTC AGCAGAACCT GCACACAGGA CAGCGGGAAA 1902 

AATCGATGAG CGCCACCTCT TTAAAAACTC ACTTACGTTG TCCTTTTTCA CTTTGAAAAG 1962 

TTGGAAGGAC TGCTGAGGCC CAGTGCATAT GCAATGTATA GTGTCTATTA T CAC ATT AAT 2022 

CTCAAAGAGA TTCGAATGAC GGTAAGTGTT CTC ATG AAG C AGGAGGCCCT TGTCGTGGGA 2082 

TGGCATTTGG TCTCAGGCAG CACCACACTG GGTGCGTCTC CAGTCATCTG TAAGAGCTTG 214 2 

CTCCAGATTC TG ATG CAT AC GGCTATATTG GTTTATGTAG TCAGTTGCAT TCATTAAATC 2 202 
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AACTTTATCA TATGCTCAAA AAAAAAAAAA AG 223 4 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A} LENGTH: 424 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME /KEY : Other 

(B) LOCATION: 1. . . 1 

(D) OTHER INFORMATION: Human Smad3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Ser Ser lie Leu Pro Phe Thr Pro Pro lie Val Lys Arg Leu Leu 

15 10 15 

Gly Trp Lys Lys Gly Glu Gin Asn Gly Gin Glu Glu Lys Trp Cys Glu 

20 25 30 

Lys Ala Val Lys Ser Leu Val Lys Lys Leu Lys Lys Thr Gly Gin Leu 

35 40 45 

Asp Glu Leu Glu Lys Ala lie Thr Thr Gin Asn Val Asn Thr Lys Cys 

50 55 60 

lie Thr He Pro Arg Ser Leu Asp Gly Arg Leu Gin Val Ser His Arg 
65 70 75 80 

Lys Gly Leu Pro His Val He Tyr cys Pro Val Arg Trp Pre Asp Leu 

85 90 95 

His Ser His His Glu Leu Arg Ala Met Glu Leu Cys Glu Phe Ala Phe 

100 105 110 

Asn Met Lys Lys Asp Glu Val Cys Val Asn Pro Tyr Kis Tyr Gin Arg 

115 120 125 

Val Glu Thr Pro Val Leu Pro Pro Val Leu Val Pro Arg His Thr Glu 

130 135 140 

He Pro Ala Glu Phe Pro Pro Leu Asp Asp Tyr Ser His Ser He Pro 
145 ISC 155 160 

Glu Asn Thr Asn Phe Pro Ala Gly He Glu Pro Gin Ser Asn He Pro 

165 170 175 

Glu Thr Pro Fro Pro Gly Tyr Leu Ser Glu Asp Gly Glu Thr Ser Asp 

18 0 .18 5 190 

His Gin Met Asn His Scr Met Asp Ala Gly Ser Pro Asn Leu Sei Pro 

155 200 205 

Asn Pro Met Ser Pro Ala His Asn Asn Leu Asp Leu Gin Pro Val Thr 

21C 215 220 

Tyr Cys Glu Pro Ala Phe Trp Cys Ser He Ser Tyr Tyr Glu Leu Asn 
225 230. 235 240 

Gin Arc Val Gly Glu Tnr Phe His Ala Ser Gin Pro Ser Met Thr Val 

245 250 255 

Asp Gly Phe Thr Asp Pro Ser Asn Ser Glu Arg Phe Cys Leu Gly Leu 

260 265 270 

Leu Ser Asn Val Asn Arg Asn Ala Ala Val Glu Leu Thr Arg Arg His 

275 280 285 

He Gly Arg Gly Val Arg Leu Tyr Tyr He Gly Gly Glu Val Phe Ala 

290 295 300 

Glu Cys Leu Ser Asp Ser Ala He Phe Val Gin Ser Pro Asn Cys Asn 
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305 








310 






315 






320 


Gin Arg 


Tyr 


Gly 


Trp 


His 


Pro 


Ala 


Thr Val Cys 


Lys 


He 


Pro Pro Gly 








325 








330 






335 


Cys Asn 


Leu 


Lys 


He 


Phe 


Asn 


Asn 


Gin Glu Phe 


Ala 


Ala 


Leu Leu Ala 






340 










345 






350 


Gin Ser 


Val 


Asn 


Gin Gly 


Phe 


Glu 


Ala Val Tyr 


Gin 


Leu 


Thr Arg Met 




355 










360 






365 


Cys Thr 


He 


Arg 


Met 


Ser 


Phe 


Val 


Lys Gly Trp 


Gly Ala Glu Tyr Arg 


370 










375 






380 






Arg Gin 


Thr 


Val 


Thr 


Ser 


Thr 


Pro 


Cys Trp He 


Glu 


Leu 


Kis Leu Asn 


385 








390 






395 






400 


Gly Pro Leu 


Gin 


Trp 


Leu 


Asp 


Lys 


Val Leu Thr 


Gin 


Met 


Gly Ser Pro 








405 








410 






415 


Ser He 


Arg 


Cys 


Ser 


Ser 


Val 


Ser 











420 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 194 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME /KEY : Other 

(B) LOCATION: 1. . .1 

<D) OTHER INFORMATION: Human Smad3 MH2 domain 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 9 : 



Trp 


Cys 


Ser 


He 


Ser 


Tyr 


Tyr 


Glu 


Leu 


Asn 


Gin 


Arg 


Val 


Gly 


Glu Thr 


1 








5 










1C 










15 


Phe 


His 


Ala 


Ser 


Gin 


Pro 


Ser 


Met 


Thr 


Val 


Asp 


Gly 


Phe 


Thr 


Asp Pro 








20 










25 










30 


Ser 


Asn 


Ser 


Glu 


Arg 


Phe 


Cys 


Leu 


Gly 


Leu 


Leu 


Ser 


Asn 


Val 


Asn Arg 






35 










40 










45 




Asn 


Ala 


Ala 


Val 


Glu 


Leu 


Thr 


Arg 


Arg 


His 


He 


Gly 


Arc 


Gly 


Val Arg 




50 










55 










60 






Leu 


Tyr 


Tyr 


He 


Gly Gly 


Glu 


Val 


Phe 


Ala 


Glu 


Cys 


Leu 


Ser Asp Ser 


65 










70 










75 








80 


Ala 


He 


Phe 


Val 


Gin 
35 


Ser 


Pro 


Asn 


Cys 


Asn 
30 


Gin 


Arg 


Tyr 


Gly 


Trp His 
95 


Pro 


Ala 


Thr 


Val 
100 


Cys 


Lys 


He 


Pre 


Pro 
105 


Gly 


Cys 


Asn 


Leu 


Lys 
110 


lie Phe 


Asn 


Asn 


Gin 


Glu 


Phe 


Ala 


Ala 


Leu 


Leu 


Ala 


Gin 


Ser 


Val 


Asn 


Gin Gly 


Phe 




115 










120 










125 




Glu 


Ala 


Val 


Tyr 


Gin 


Leu 


Thr 


Arc 


Met 


Cys 


Thr 


He 


Arg 


Mel Ser 


Phe 


130 










135 










140 






Val 


Lys 


Gly 


Trp 


Gly 


Ala 


Glu 


Tyr 


Arg 


Arg 


Gin 


Thr 


Val 


Thr Ser 


145 










150 










155 








160 


Thr 


Pro 


Cys 


Trp 


He 
165 


Glu 


Leu 


His 


Leu 


Asn 
170 


Gly 


Pro 


Leu 


Gin 


Trp Leu 
175 


Asp 


Lys 


Val 


Leu 


Thr 


Gin 


Met 


Gly 


Ser 


Pro 


Ser 


He 


Arg 


Cys 


Ser Ser 








180 










135 








190 





Val Ser 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...1554 
(D) OTHER INFORMATION: 



(A) NAME/ KEY : Other 

(B) LOCATION: 1...1 

(D) OTHER INFORMATION: Xenopus FAST-1 coding region 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

ATG AGA GAC CCC TCC AGT CTG TAC TCA GGA TTC CCA GCT GGA TCC CAG 4 8 

Met Arg Asp Pro Ser Ser Leu Tyr Ser Gly Phe Pro Ala Gly Ser Gin 
15 io 15 

TAT GAA AGT GTG GAG CCT CCC AGC CTT GCC CTG CTG AGC TCT ATA GAC 9 6 

Tyr Glu Ser Val Glu Pre Pro Ser Leu Ala Leu Leu Ser Ser He Asp 
20 25 30 

CAG GAG CAG CTC CCA GTG GCC ACC GGC CAG TCC TAT AAT CAC AGT GTC 144 
Gin Glu Gin Leu Pro Val Ala Thr Gly Gin Ser Tyr Asn His Ser Val 
35 40 45 

CAG CCT TGG CCC CAA CCT TGG CCA CCC TTG TCC CTG TAC AGA GAG GGG 192 
Gin Pre Trp Pro Gin Pro Trp Pro Pro Leu Ser Leu Tyr Arg Glu Gly 
50 5S 60 

GGC ACG TGG AGC CCA GAC AGA GGC AGT ATG TAT GGA CTC TCC CCC GGC 24 0 

Gly Thr Trp Ser Pro Asp Arg Gly Ser Met Tyr Gly Leu Ser Pro Giv- 
es 70 75 80 

ACC CAC GAG GGC TCC TGC ACC CAC ACT CAC GAG GGC CCC AAG GAC TCA 28 8 

Thr His Glu Gly Ser Cys Thr His Thr His Glu Gly Pro Lys Asp Ser 
85 90 95 

ATG GCA GGA GAC CAG ACC AGG TCC AGG AAG AGC AAA AAG AAG AAT TAT 33 6 

Met Ala Gly Asp Gin Thr Arg Ser Arg Lys Ser Lys Lys Lys Asn Tyr 
100 10b 110 

CAT CGA TAT AAC AAG CCC CCC TAT TCC TAC CTC GCT ATG ATT GCC CTG 384 
His Arg Tyr Asn Lys Pro Pro Tyr Ser Tyr Leu Ala Met lie Ala Leu 
HE 120 125 

GTC ATC CAG AAC TCG CCC GAG AAG AGG CTC AAA CTC TCC CAG ATC CTG 43 2 

Val He Gin Asn Ser Pro Glu Lys Arg Leu Lys Leu Ser Gin He Leu 
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130 135 140 

AAG GAG GTC AGT ACA CTC TTC CCC TTC TTT AAT GGG GAT TAT ATG GGT 4 80 

Lys Glu Val Ser Thr Leu Phe Pro Phe Phe Asn Gly Asp Tyr Met Gly 
145 150 155 160 

TGG AAA GAC TCC ATC AGG CAC AAC TTG TCT TCC AGT GAC TGC TTT AAG 52 3 

Trp Lys Asp Ser lie Arg His Asn Leu Ser Ser Ser Asp Cys Phe Lys 
165 170 175 

AAG ATT CTC AAA GAC CCT GGA AAG CCC CAG GCC AAG GGT AAC TTC TGG 576 
Lys lie Leu Lys Asp Pro Gly Lys Pro Gin Ala Lys Gly Asn Phe Trp 
180 185 190 

ACG GTG GAT GTT AGC CGG ATT CCT CTG GAT GCG ATG AAG CTG CAG AAC 624 
Thr Val Asp Val Ser Arg lie Pro Leu Asp Ala Met Lys Leu Gin Asn 
195 200 205 

ACT GCG TTG ACC CGA GGT GGA TCA GAC TAC TTT GTC CAG GAT TTG GCT 672 
Thr Ala Leu Thr Arg Gly Gly Ser Asp Tyr Phe Val Gin Asp Leu Ala 
210 215 220 

CCA TAC ATC CTA CAT AAC TAT AAA TAT GAG CAC AAT GCA GGG GCG TAT 72 0 

Pro Tyr He Leu His Asn Tyr Lys Tyr Glu His Asn Ala Gly Ala Tyr 
225 230 235 240 

GGT CAC CAG ATG CCT CCA AGT CAT GCC AGA TCC CTG TCT TTG GCA GAG 76 8 

Gly His Gin Met Pro Pro Ser His Ala Arg Ser Leu Ser Leu Ala Glu 
245 250 255 

GAC TCT CAA CAG ACC AAC ACT GGT GGC AAA CTT AAC ACA TCC TTT ATG 816 
Asp Ser Gin Gin Thr Asn Thr Gly Gly Lys Leu Asn Thr Ser Phe Met 
260 265 270 

ATT GAT TCC CTA CTC CAT GAC CTG CAA GAG GTG GAT CTG CCT GAT GCC 86 4 

He Asp Ser Leu Leu His Asp Leu Gin Glu Val Asp Leu Pro Asp Ala 
275 280 285 

TCC AGG AAC CTT GAG AAC CAA AGG ATC TCT CCG GCT GTA GCC ATG AAC 912 
Ser Arg Asn Leu Glu Asn Gin Arg He Ser Pro Ala Val Ala Met Asn 
290 295 3 DC 

AAT ATG TGG AGC TCT GCT CCT CTT CTC TAC ACT CAT TCC AAG CCA ACA 96 0 

Asn Met Trp Ser Ser Ala Pre Leu Leu Tyr Thr His Ser Lys Pro Thr 
305 310 315 320 

AGG AAT GCC AGA AGC CCT GGT TTG TCC ACC ATC CAT TCC ACG TAC TCC 100 8 

Arg Asn Ala Arg Ser Pro Gly Leu Ser Thr He His Ser Thr Tyr Ser 
32 5 33 0 33 5 

TCT TCC AGC TCC AGC ATT TCT ACA ATC TCC CCC GTT GGG TTT CAG AAG 105 6 
Ser Ser Ser Ser Ser He Ser Thr lie Ser Pro Val Gly Phe Gin Lys 
340 345 350 

GAG CAG GAG AAA AGT GGT CGA CAA ACT CAA AGG GTT GGC CAT CCC ATT 1104 
Glu Gin Glu Lys Ser Gly Arg Gin Thr Gin Arg Val Gly His Pro He 
355 360 365 

AAA CGA TCA AGA GAG GAC GAT GAC TGC AGT ACC ACA TCT TCA GAT CCT 1152 
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Lys Arg Ser Arg Glu Asp Asp Asp Cys Ser Thr Thr Ser Ser Asp Pro 
370 375 380 

GAC ACT GGG AAC TAC TCT CCC ATT GAG CCC CCA AAG AAG ATG CCC TTG 1200 
Asp Thr Gly Asn Tyr Ser Pro He Glu Pro Pro Lys Lys Met Pro Leu 
385 390 395 400 

CTT TCA TTG GAC TTG CCC ACT TCT TAC ACA AAG AGT GTG GCA CCT AAT 124 8 
Leu Ser Leu Asp Leu Pro Thr Ser Tyr Thr Lys Ser Val Ala Pro Asn 
405 410 415 

GTA GTG GCA CCA CCA AGT GTC CTG CCC TTC TTT CAT TTT CCT CGC TTC 12 96 
Val Val Ala Pro Pro Ser Val Leu Pro Phe Phe His Phe Pro Arg Phe 
420 425 430 

ACC TAC TAT AAT TAT GGA CCT TCA CCC TAC ATG ACC CCA CCA TAC TGG 1344 
Thr Tyr Tyr Asn Tyr Gly Pro Ser Pro Tyr Met Thr Pro Pro Tyr Trp 
435 440 445 

GGT TTT CCA CAT CCT ACA AAT TCT GGT GGG GAT AGT CCA CGT GGA CCC 1392 
Gly Phe Pro His Pro Thr Asn Ser Gly Gly Asp Ser Pro Arg Gly Pro 
450 455 460 

CAA TCT CCT CTG GAC CTA GAC AAC ATG TTA CGG GCC ATG CCA CCC AAC 14 4 0 
Gin Ser Pro Leu Asp Leu Asp Asn Met Leu Arg Ala Met Pro Pro Asn 
465 470 475 480 

AAG AGT GTG TTT GAT GTG TTG ACA AGT CAC CCA GGT GAC CTC GTC CAT 14 8 8 
Lys Ser Val Phe Asp Val Leu Thr Ser His Pro Gly Asp Leu Val His 
485 490 495 

CCG TCC TTC CTC AGT CAA TGC TTG GGC AGC AGT GGT TCC CCG TAC CCA 15 3 6 
Pro Ser Phe Leu Ser Gin Cys Leu Gly Ser Ser Gly Ser Pro Tyr Pro 
500 505 510 

AGC AGA CAA GGC CTT ATG TAGAGACGGA GGCCTCCTGG CCTGACCTGG AGTGGACA 1592 
Ser Arg Gin Gly Leu Met 
515 

CTCAATGAAA TGA 16 05 

(2) INFORMATION FOR SEQ ID NO : 11 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 534 amino acids 
(R) TYPE: amine acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY: linear 

(ii) MOLECULE TYFE : protein 
(ix) FEATURE : 

(A) NAME/ KEY : Other 

(B) LOCATION: 1 . . . 1 

(D) OTHER INFORMATION: Xenopus FAST-1 



Ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
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Met Arg Asp Pro Ser Ser Leu Tyr Ser Gly Phe Pro Ala Gly Ser Gin 

15 10 15 

Tyr Glu Ser Val Glu Pro Pro Ser Leu Ala Leu Leu Ser Ser He Asp 

20 25 30 

Gin Glu Gin Leu Pro Val Ala Thr Gly Gin Ser Tyr Asn His Ser Val 

35 40 J 45 

Gin Pro Trp Pro Gin Pro Trp Pro Pro Leu Ser Leu Tyr Arg Glu Gly 

50 55 60 

Gly Thr Trp Ser Pro Asp Arg Gly Ser Met Tyr Gly Leu Ser Pro Gly 
65 70 75 90 

Thr His Glu Gly Ser Cys Thr His Thr His Glu Gly Pro Lys Asp Ser 

85 90 95 

Mec Ala Gly Asp Gin Thr Arg Ser Arg Lys Ser Lys Lys Lys Asn Tyr 

100 105 lio 

His Arg Tyr Asn Lys Pro Pro Tyr Ser Tyr Leu Ala Met He Ala Leu 

115 120 125 

Val He Gin Asn Ser Pre Glu Lys Arg Leu Lys Leu Ser Gin He Leu 

130 135 140 

Lys Glu Val Ser Thr Leu Phe Pro Phe Phe Asn Gly Asp Tyr Met Gly 
I* 5 150 155 ^ 160 

Trp Lys Asp Ser He Arg His Asn Leu Ser Ser Ser Asp Cys Phe Lys 

165 170 175 

Lys He Leu Lys Asp Pro Gly Lys Pro Gin Ala Lys Gly Asn Phe Trp 

180 185 190 

Thr Val Asp Val Ser Arg He Pro Leu Asp Ala Met Lys Leu Gin Asn 

195 200 205 

Thr Ala Leu Thr Arg Gly Gly Ser Asp Tyr Phe Val Gin Asp Leu Ala 

210 215 220 

Pro Tyr He Leu His Asn Tyr Lys Tyr Glu His Asn Ala Gly Ala Tyr 
225 230 235 240 

Gly His Gin Met Pro Pro Ser His Ala Arg Ser Leu Ser Leu Ala Glu 

2^5 250 255 

Asp Ser Gin Gin Thr Asn Thr Gly Gly Lys Leu Asn Thr Ser Phe Met 

260 265 270 

He Asp Ser Leu Leu His Asp Leu Gin Glu Val Asp Leu Pro Aso Ala 

2^5 280 285 

Ser Arg Asn Leu Glu Asn Gin Arg He Ser Pro Ala Val Ala Met Asn 

290 295 300 

Asn Met Trp Ser Ser Ala Pro Leu Leu Tyr Thr His Ser Lys Pro Thr 
305 310 3 i 5 ^ 320 

Arg Asn Ala Arg Ser Pro Gly Leu Ser Thr He His Ser Thr Tyr Ser 

325 330 335 

Ser Ser Ser Ser Ser He Ser Thr He Ser Pro Val Gly Fhe Gin Lys 

34n 345 350 

Glu Gin Glu Lys Ser Gly Arg Gin Thr Gin Arg Val Gly His Pro He 

355 360 365 

Lys Arg Ser Arg Glu Asp Asp Asp Cys Ser Tnr Tnr Ser Ser Asp Pro 

370 375 33C 

Asp Thr Gly Asn Tyr Ser Pro lie Glu Pro Pro Lys Lys Met Pro Leu 
3e5 393. 395 400 

Leu Ser Leu Asp Leu Pro Thr Ser Tyr Thr Lys Ser Val Ala Pro Asn 

405 410 415 

Val Val Ala Pro Pro Ser Val Leu Pro Fhe Phe His Phe Pro Arg Phe 

420 425 430 

Thr Tyr Tyr Asn Tyr Gly Pro Ser Pro Tyr Met Thr Pro Pro Tyr Trp 

435 440 445 

Gly Phe Pro His Pro Thr Asn Ser Gly Gly Asp Ser Pro Arg Gly Pre 

450 455 " 460 

Gin Ser Pro Leu Asp Leu Asp Asn Met Leu Arg Ala Met Pro Pro Asn 
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465 470 
Lys Ser Val Phe Asp Val Leu Thr 
4B5 

Pro Ser Phe Leu Ser Gin Cys Leu 
500 

Ser Arg Gin Gly Leu Met Tyr Arg 
515 520 
Ser Gly His Ser Met Lys 
530 



475 480 
Ser His Pro Gly Asp Leu Val His 

490 495 
Gly Ser Ser Gly Ser Pro Tyr Pro 
505 510 
Arg Arg Pro Pro Gly Leu Thr Trp 
525 



(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 



(A) NAME /KEY : Other 

(B) LOCATION: 1. . .1 

(D) OTHER INFORMATION: Xenopus FAST-1 SID 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Thr Ser Ser Asp 

X 

Lys Lys Met Pro 
20 

Ser Val Ala Pro 
35 

His Phe Pro Arg 
50 

Thr Pro Pro Tyr 
65 

Ser Pro Arg Gly 

Ala Met Pro Pro 
1C0 

Gly Asp Leu Val 
HE 

Gly Ser Pro Tyr 
130 

Pro Gly Leu Thr 
14 5 



Pro Asp Thr Gly 
5 

Leu Leu Ser Leu 

Asa Val Val Ala 
40 

Phe Thr Tyr Tyr 
55 

Trp Gly Phe Pro 
70 

Pro Gin Ser Pro 
85 

Asn Lys Ser Val 

His Pr:; Ser Phe 
120 

Pro Ser Arg Gin 
135 

Trp Ser Gly His 
ISO 



Asn Tyr Ser Pro 
10 

Asp Leu Pro Thr 

25 

Fro Pro Ser Val 

Asn Tyr Gly Pro 
60 

His Pro Thr Asn 
75 

Leu Asp Leu Asp 
90 

Phe Asp Val Leu 
105 

Leu Ser Gin Cys 

Gly Leu Met Tyr 
140 

Ser Met: Lys 
155 



He Glu Pro Pro 

15 

Ser Tyr Thr Lys 
30 

Leu Pro Phe Phe 
45 

Ser Pro Tyr Met 

Ser Gly Gly Asp 
80 

Asn Me:: Leu Arg 
95 

Thr Ser His Pro 
110 

Leu Gly Ser Ser 
125 

Arg Arg Arg Pro 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i; SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 16 34 oase pairs 
{3> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 
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( A) NAME/KEY: Other 
(3) LOCATION: 1. . .1 

(D) 0TH2R INFORMATION: Human FAST-1 coding region 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

ATGGGGCCCT GCAGCGGCTC CCGCCTGGGG CCCCCAGAGG CTGAGTCGCC CTCCCAGCCC 6 0 

CCTAAGAGGA GGAAGAAGAG GTACCTGCGA CATGACAAGC CCCCCTACAC CTACTTGGCC 12 0 

ATGATCGCCT TGGTGATTCA GGCCGCTCCC TCCCGCAGAC TGAAGCTGGC CCAGATCATC 16 0 

CGTCAGGTCC AGGCCGTGTT CCCCTTCTTC AGGGAAGACT ACGAGGGCTG GAAAGACTCC 24 0 

ATTCGCCACA ACCTTTCCTC CAACCGATGC TTCCGCAAGG TGCCCAAGGA CCCTGCAAAG 30 0 

CCCCAGGCCA AGGGCAACTT CTGGGCGGTC GACGTGAGCC TGATCCCAGC TGAGGCGCTC 360 

CGGCTGCAGA ACACCGCCCT GTGCCGGCGC TGGCAGAACG GAGGTGCGCG TGGAGCCTTC 420 

GCCAAGGACC TGGGCCCCTA CGTGCTGCAC GGCCGGCCAT ACCGGCCGCC CAGTCCCCCG 480 

CCACCACCCA GTGAGGGCTT CAGCATCAAG TCCCTGCTAA GAAGGTCCGG GGAAGGGGCA 54 0 

CCCTGGCCGG GGCTAGCTCC ACAGAACAGC CCAGTTCCTG CAGGCACAGG GAACAATGGG 60 0 

GAAGAAGCGG TGCCCACCCC ACCCCTTCCC TCTTCTGAAA GGCCTCTGTG GCCCCTCTGC 660 

CCCCTTCCTG GCCCCACGAG AGTGGAGGGG GAGACTGTGC AGGGGGGAGC CATGGGCCCT 720 

CAACCCTCTC CCCAGAGCCT AGGGCCTGGC CTTTCCACTA CTGCAGGGCA CCGCAGTTCT 7 80 

GGGGGACGGT CCAGCGGGGG ACACAGGGCC TCCCTTTGGG GGCAGCTGCC CACCTCCTAC 84 0 

TTGCCTATCT ACACTCCCAA TGTGGTAATG CCCTTGGCAC CACCACCCAC CTCCTGTCCC 900 

CAGTGTCCGT CAACCAGCCC TGCCTACTGG GGGGTGGCCC CTGAAACCCG AGGGCCCCCA 960 

GGGCTGCTCT GCGATCTAAA CGCCCTCTTC CAAGGGGTGC CACCCAACAA AAGCATCTAC 1020 

GACGTTTGGG TCAGCCACCC TCGGGACCTG GCGGCCCCTG GCCCAGGCTG GCTGCTCTCC 1080 

TGGTGCAGCC TGTGAGGCTC TTAAGACAGG GGCCGCTCCT CCCTCCCGCT CCCACCCCCA 1140 

CCTTGTTGAC AGGGAGCCAA GGCGAGGCGG CTGTCTGCGA CCACAGCAGC CTCGAAACAC 1200 

CAGGCAGCAG CCTTGCTGGG AGTCCACGGT GTTTATTGGG CCACCCCACG CATGGCCGTG 1260 

GCCCAGCTGG GCACAACCCT CACCCTGGTC TGTCATGCCT GTTTTTCCTA CACTCAGCGG 13 20 

CAAAACTGCA GGAGCAGGCT GAACCTGAAT ATCCCTTCCT AATCCCTCTT CTCAGCCCAC 13 80 

TACCCATCCA TCAGTCACCA GCCGTCACCT CCCTCCCGTG CTCCAGCTGG GGGAAGGAAA 144 0 

ACCCATGTGG ATCACCTGAA ATCCTGCCCT CTCTCTCTGT CGGAAAAGAA GTCCACCTTT 15 00 

TTCCGGAAAC CGGTTAGGGA ATTAAAATGC CCTACATGTC CTGGTGGTTG GGGGGGAAAC 15 60 

CACTAAAGGA ATTTGCAACC TTTTTTATCC TCTTTCATTT ATCCCAAGGG GGGGCCCGTC 16 20 

CCATTTCCCC AACC 16 34 

(2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 544 amino acids 
(E) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME/ KEY: Other 

(B) LOCATION: 1 ... 1 

(D) OTHER INFORMATION: Human FAST-1 



(xi) SEQUENCE DESCRIPTION: SEQ IE NO: 14: 

Met Gly Pro Cys Ser Gly Ser Arg Leu Gly Pro Pro Glu Ala Glu Ser 

15 10 15 

Pro Ser Gin Pro Pro Lys Arg Arg Lys Lys Arc Tyr Leu Arg His Asp 

20 25 30 

Lys Pro Pro Tyr Thr Tyr Leu Ala Met lie Ala Leu Val lie Gin Ala 
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35 






40 










45 








Ala Pro 


Ser 


Arg Arg Leu 


Lys Leu 


Ala 


Gin 


He 


lie 


Arg 


Gin 


Val 


Gin 


50 








55 








60 










Ala Val 


Phe 


Pro 


Phe Phe Arg Glu Asp 


Tyr 


Glu 


Gly Trp 


Lys 


Asp 


Ser 


65 






70 








75 










80 


lie Arg 


His Asn 


Leu Ser 


Ser Asn Arg 


Cys 


Phe 


Arg 


Lys 


Val 


pro Lys 








85 






90 










95 




Asp Pro Ala Lys Pro Gin Ala Lys Gly 


Asn 


Phe 


Trp Ala 


Val Asp Val. 






100 






105 










110 






Ser Leu 


He 
115 


Pro 


Ala Glu 


Ala Leu 
120 


Arg 


Leu 


Gin 


Asn 


Thr 
125 


Ala 


Leu 


Cys 


Arg Arg Trp Gin Asn Gly Gly Ala Arg 


Gly Ala 


Phe 


Ala 


Lys 


Asp 


Leu 


130 








135 








140 










Gly Pro Tyr Val Leu His 


Gly Arg 


Pro 


Tyr Arg 


Pro 


Pro 


Ser 


Pro 


Pro 


145 






150 








155 










160 


Pro Pro 


Pro 


Ser 


Glu Gly 
165 


Phe Ser 


lie 


Lys 
170 


Ser 


Leu 


Leu 


Arg 


Arg 
175 


Ser 


Gly Glu Gly Ala 


Pro Trp 


Pro Gly Leu 


Ala 


Pro 


Gin 


Asn 


Ser 


Pro 


Val 






180 






185 










190 






Pro Ala Gly Thr Gly Asn Asn Gly Glu 


Glu 


Ala 


Val 


Pro 


Thr 


Pro 


Pro 




195 






200 










205 








Leu Pro 


Ser 


Ser Glu Arg 


Pro Leu Trp 


Pro 


Leu 


Cys 


Pro 


Leu 


Pro Gly 


210 








215 








220 










Pro Thr Arg Val 


Glu Gly Glu Thr Val 


Gin Gly 


Gly Ala 


Met 


Gly 


Pro 


225 






230 








235 










240 


Gin Pro 


Ser 


Pro 


Gin Ser 


Leu Gly 


Pro 


Gly Leu 


Ser 


Thr 


Thr Ala Gly 








245 






250 










255 




His Arc 


Ser 


Ser 


Gly Gly Arg Ser 


Ser 


Gly 


Gly 


His 


Arg 


Ala 


Ser 


Leu 






260 






265 










270 






Trp Gly Gin Leu 


Pro Thr 


Ser Tyr 


Leu 


Pre 


lie 


Tyr 


Thr 


Pro 


Asn 


Val 




275 






280 










285 








Val Met 


Pro 


Leu 


Ala Pro 


Pro Pro 


Thr 


Ser 


Cys 


Pro 


Gin 


Cys 


Pro 


Ser 


290 








295 








300 










Thr Ser 


Pro 


Ala 


Tyr Trp 


Gly Val 


Ala 


Pro 


Glu 


Thr 


Arg 


Gly 


Pro 


Pro 


305 






310 








315 










320 


Gly Leu 


Leu 


Cys 


Asp Leu 
325 


Asn Ala 


Leu 


Phe 

330 


Gin 


Gly 


Val 


Pro 


Pro 
335 


Asn 


Lys Ser 


He 


Tyr 
340 


Asp Val 


Trp Val 


Ser 
345 


His 


Pro 


Arg 


Asp 


Leu 
350 


Ala 


Ala 


Pro Gly 


Pro 
355 


Gly 


Trp Leu 


Leu 3er 
360 


Trp 


Cys 


Ser 


Leu 


Glx 
365 


Gly 


Ser 


Glx 


Asp Arg 


Gly 


Arg 


Ser Ser 


Leu Pro 


Leu 


Pro 


Pro 


Pro 


Pro 


Cys 


Glx 


Gin 


370 








375 








380 










Gly Ala 


Lys 


Ala 


Arg Arg 


Leu Ser 


Ala 


Thr 


Thr 


Ala 


Ala 


Ser 


Lys 


His 


385 






390 








395 










400 


Gin Ala 


Ala 


Ala 


Leu Leu 


Gly Val 


His 


Gly Val 


Tyr 


Trp 


Ala 


Thr 


Pro 








405 






410 










415 




Arg Met 


Ala 


Val 


Ala Gin 


Leu Gly Thr 


Thr 


Leu 


Thr 


Leu 


Val 


Cys 


His 






42 0 






425 










430 






Ala Cys 


Phe 

435 


3er 


Tyr Thr 


Gin Arg 
440 


Gin 


Asn 


Cys 


Arg 


Ser 
445 


Arg 


Leu 


Asn 


Leu Asn 


He 


Pro 


Ser Glx 


Ser Leu 


Phe 


Ser 


Ala 


His 


Tyr 


Pro 


Ser 


He 


450 








455 








460 










Ser His 


Gin 


Pro 


Ser Pre 


Pro Ser 


Arg 


Ala 


Pro 


Ala 


Gly 


Gly Arg 


Lys 


465 






470 








475 










480 


Thr His 


Val 


Asp 


His Leu 
485 


Lys Ser 


Cys 


Pro 
490 


Leu 


Ser 


Leu 


Set 


Glu 
495 


Lys 


Lys Ser 


Thr 


Phe 
500 


Phe Arg 


Lys Pro 


Val 
505 


Arg 


Glu 


Leu 


Lys 


Cys 
510 


Pro 


Thr 



WO 98/53830 



PCT/US98/10983 



-69- 

Cys Pro Gly Gly Trp Gly Gly Asn His Glx Arg Asn Leu Gin Pro Phe 

515 520 525 

Leu Ser Ser Phe lie Tyr Pro Lys Gly Gly Pro Val Pro Phe Pro Gin 
530 535 540 

{2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME / KEY : Other 

(B) LOCATION: 1 . . .1 

(D) OTHER INFORMATION: Human FAST-1 SID 



<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 15: 

Gin Gly Gly Ala Met Gly Pro Gin Pro Ser Pro Gin Ser Leu Gly Pro 

15 10 15 

Gly Leu Ser Thr Thr Ala Gly His Arg Ser Ser Gly Gly Arg Ser Ser 

20 25 30 

Gly Gly His Arg Ala Ser Leu Trp Gly Gin Leu Pro Thr Ser Tyr Leu ' 

35 40 45 

Pro lie Tyr Thr Pro Asn Val Val Met Pro Leu Ala Pro Pro Pro Thr 

50 55 60 

Ser Cys Pro Gin Cys Pro Ser Thr Ser Pro Ala Tyr Trp Gly Val Ala 
65 70 75 80 

Pro Glu Thr Arg Gly Pro Pro Gly Leu Leu Cys Asp Leu Asn Ala Leu 

85 90 95 

Phe Gin Gly Val Pro Pro Asn Lys Ser lie Tyr Asp Val Trp Val Ser 

100 105 110 

His Pro Arg Asp Leu Ala Ala Pro Gly Pro Gly Trp Leu Leu Ser Trp 

115 120 125 

Cys Ser Leu Glx Gly Ser Glx Asp Arg Gly Arg Ser Ser Leu Pro Leu 

130 135 14C 

Pro Pro Pro Pro Cys Glx Gin Gly Ala Lys Ala Arc Arg Leu Ser Ala 
14S 150 155 160 

Thr Thr Ala Ala Ser Lys His Gin Ala Ala Ala Leu Leu Gly Val His 

165 170 175 

Gly Val Tyr Trp Ala Thr Pro Arg Met Ala Val Ala Gin Leu Gly Thr 

180 165 190 

Thr Leu Thr Leu Val Cys His Ala Cys Phe Ser Tyr Thr Gin Arg Glr. 

195 200 205 

Asn Cys Arg Ser Arg Leu Asn Leu Asn He Pro Ser Glx Ser Leu Phe 

210 215 22C 

Ser Ala His Tyr Pro Ser He Ser His Gin Pro Ser Pro Pro Ser Arc 
225 230 235 24C 

Ala Pro Ala Gly Gly Arg Lys Thr His Val Asp His Leu Lys Ser Cys 

245 250 255 

Pro Leu Ser Leu Ser Glu Lys Lys Ser Thr Phe Phe Arg Lys Pro Val 

260 265 270 

Arg Glu Leu Lys Cys Pro Thr Cys Pro Gly Gly Trp Gly Gly Asn His 
275 280 285 
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Glx Arg Asn Leu Gin Pro Phe Leu Ser Ser Phe He Tyr Pro Lys Gly 

290 295 300 

Gly Pro Val Pro Phe Pro Gin 
305 310 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1668 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/ KEY : Other 

(B) LOCATION: 

(D) OTHER INFORMATION: Mouse FAST-1 coding region 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

ATGGCCTCGG GCTGGGACCT GGCCTCAACT TACACTCCGA CTACCCCGAG CCCCCAGTTA 6 0 

GCCCTGGCTC CGGCCCAGGG CTACCTCCCT TGTATGGGGC CTCGCGACAA CTCTCAGCTG 120 

AGGCCTCCAG AGGCAGAGTC TCTTTCGAAG ACCCCCAAGA GGAGGAAGAA GAGATACCTA 18 0 

CGGCATGACA AGCCCCCCTA CACCTACTTG GCCATGATCG CCTTGGTAAT TCAGGCCGCA 24 0 

CCCTTCCGCA GGCTGAAACT GGCTCAGGTC CAGGCAGTGT TCCCCTTCTT CAGGGACGAC 3 00 

TATGAGGGCT GGAAGGACTC CATCCGCCAC AACCTTTCCT CTAATCGGTG CTTCCATAAG 360 

GTGCCCAAGG ACCCTGCAAA GCCCCAGGCC AAGGGCAACT TCTGGGCGGT GGATGTTAGC 42 0 

CTGATTCCTG CTGAGGCGCT GCGCCTTCAG AACACTGCCC TGTGCCGTCG ATGGCAGAAC 480 

CGGGGCACCC ACAGAGCTTT CGCCAAGGAC CTGAGCCCCT ACGTGCTCCA CGGCCAGCCT 5 4 0 

TATCAGCCAC CCAGTCCCCC ACCACCACCT AGGGAGGGTT TCAGCATCAA GTCCCTCCTA 600 

GGGGACCCTG GGAAAGAATC CACATGGCCC CAGCATCCTG GGCTCCCTGG AC AG AG C ACT 660 

GCAGCTCAGG CAGGCACCTT GTCAAAGGGG GAAGAAGGGA TGGGCACTGG ACCCTCTAGC 72 0 

TCCTCTGAGA CGCCTCTGTG GCCCCTCTGC TCCCTTCCTG GGCCCACAAT CATAGAGGGG 78 0 

GAGAGTTCCC AAGGGGAGGT AATCAGGCCT TCTCCCGTCA CCCCAGATCA AGGCTCCTGG 84 0 

CCCCTCCACT TACTTGAGGA TTCCGCAGAT TCCAGGGGAG TGCCCAGGAG GGGGAGCAGA 90 0 

GCCTCCTTGT GGGGACAGCT ACCCACTTCT TACTTGCCCA TCTATACGCC CAATGTAGTA 96 0 

ATGCCCTTGG CCACACTACC GACCACCTCT TGTCCCCAGT GCCCATCTTC TGCCAGCCCA 1G2 0 

GCTTACTGGA GCGTAGGCAC TGAATCCCAA GGGTCCCAGG ACCTGCTCTG TGATCTAGAC 108 0 

TCCCTCTTCC AGGGAGTACC ACCCAACAAG AGTATCTATG ATGTGTGGGT CAGCCATCCT 114 0 

AGGGACCTGG CAGCTCCTGC CCCAGGCTGG CTCCTTTCCT GGTACAGCAT GTAATATTCT 12 0 0 

AGGGCAGAAA GGGCTGTTCT CTCTTCCACC CATGAATATC ATTTTGATGA ACCAGAGCTA 12 6 0 

GGACGATGTC CCGACGGACA GCTTTAAAAC ACCAGCACAG CCTTGCTGAA AACCCACAGC 13 2 0 

TT7AATTAGG TTACTCCAGA AAGG3TTGTC TCTGCTAGAT AGGGAGGTCT GGCCAATCGT 13 8 0 

GCCAGGAGCG GAGCTCAGCC TGTAGAGTGC CTCCTCTTGA TCCTACCTTT TCAGGCCCTC 14 4 0 

AAGCCATCCA TCTATCCATC CCTCTGTCAC CATGCCTTCC TGGCTCCAGG CTGGGGGGAG 15 0 0 

GGAGAGCCAA AAGTGGGTCT GATCTGAAGT CTTGCCCTCT CTCAAATGCC TGGGTAGAGG 15 6 0 

GTAGCACCTT TCAGGGAAAG GGTTAAGAAA TGAAAGACTG GAACGGACAT AATTTTGGTG 162 0 

TAATGGAAGT AGGGGAGCGA TTAATAGTAA AGGAATTTAC AACATTTT 16 6 8 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME /KEY : Other 

(B) LOCATION: 

(D) OTHER INFORMATION: Mouse FAST-1 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 17 : 



Met 


Ala 


Ser 


Gly 


i rp 


Asp 


T on 


Aia 


Ser 


Thr 


Tyr Thr 


Pro 


Thr 


Thr 


Pro 




















10 










15 




Ser 




ul 11 


Leu 


Ala 


Leu 


ai a 


Pro 


Ala 


Gin 


Gly Tyr Leu 


Pro 


Cys 


Met 








Z U 










25 










30 






Gly 


Pro 


Arg 


Asp 


Asn 


Ser 


um 


Leu 


Arg 


Pro 


Pro 


Glu 


Ala 


Glu 


Ser 


Leu 






"5 £ 
J O 










4 0 










45 








S er 


Ly s 


i nr 


Pro 


Lys 


Arg 


Arg 


Lys 


Lys 


Arg 


Tyr 


Leu 


Arg 


His 


Asp Lys 




C A 

jU 










55 










60 










Pre 


Pro 


Tyr 


i nr 


Tyr 


Leu 


Ala 


Met 


lie 


Ala 


Leu 


Val 


He 


Gin 


Ala 


Ala 


6 5 










/ u 










75 










80 


Pro 


Phe 


Arg 


Arg 


Leu 


Lys 


Leu 


Aia 


Gin 


Val 


Gin 


Ala 


Val 


Phe 


Pro 


Phe 




















90 










95 




Phe 


Arg 


sp 


Asp 


Tyr 


m ii 

olU 


uiy 


Trp 


Lys 


Asp 


Ser lie Arg His 


Asn 


Leu 








i a a 
1 u u 










105 










110 






Ser 


Ser 


Asn 


Arg 


Cys 


rne 


HIS 


Lys 


Val 


Pro 


Lys 


Asp 


Pro 


Ala 


Lys 


Pro 






TIC 

lib 










12 0 










125 








Gin 


Ala 




uiy 


Asn 


Phe 


Trp 


Ala 


Val 


Asp 


Val 


Ser 


Leu 


He 


Pro 


Ala 














13 5 










140 










m ii 


Ala 


Leu 


Arg 


Leu 


Gin 


Asn 


Thr 


Ala 


Leu 


Cys 


Arg 


Arg 


Trp 


Gin 


Asn 












150 










155 




160 


Arg 


Gly 


Thr 




Arg 


Ala 


Phe 


Al a 


Lys 


Asp 


Leu 


Ser 


Pro 


Tyr 


Val 


Leu 










165 










170 










175 




His 


Glv 


Gin 


Pro 


Tyr 


Gin 


Pro 


Pro 


Ser 


Pro 


Pro 


Pro 


Pro 


Pro 


Arg 


Glu 








180 










18 5 










190 






Glv 


Phe 


Ser 


lie 


Lys 


Ser 


Leu 


Leu 


oiy 


Asp 


Pro 


Gly 


Lys 


Glu 


Ser 


Thr 






195 










2 0 0 










205 








Trc 


Pro 


Gin 


Hi s 


Pro 


Gly 


Leu 


Pro 


Gly 


Gin 


Ser 


Thr 


Ala 


Ala 


Gin 


Ala 




210 




















220 










Gly 


Thr 


Leu 


Ser 


Lys 


Gly 


Gl u 


Glu 


Gi v 

°-y 


Met 


Gly 


Thr Gly 


Pre 


Ser 


Ser 


225 










230 










235 










240 


Ser 


Ser 


Glu 


Thr 


Pro 


Leu 


Trp 


Pro 


Leu 


Cys 


Ser 


Leu 


Pro Gly 


Pro 


Thr 










245 










250 










255 




He 


He 


Glu 


Gly 


Glu 


Ser 


Ser 


Gin 


Gly 


Glu 


Val 


He 


Arg 


Pro 


Ser 


Pro 








260 










265 










270 






Val 


Thr 


Pro 


Asp 


Gin 


Gly 


Ser 


Trp 


Pio 


Leu 


H.i s 


Leu 


Leu 


Gl u 


Asp 


Ser 






275 










280 










285 








Ala 


Asp 


Ser 


Arg 


Gly Val 


Pro 


Arg 


Arg 


Gly 


Ser 


Arg 


Ala 


Ser 


Leu 


Trp 




290 










2S5 










3C0 








Gly 


Gin 


Leu 


Pro 


Thr 


Ser 


Tyr 


Leu 


Pro 


lie 


Tyr 


Thr 


Pro 


Asn 


Val 


Val 


305 










310 










315 










320 


Met 


Pro 


Leu 


Ala 


Thr 


Leu 


Pro 


Thr 


Thr 


Ser 


Cys 


Pro 


Gin 


Cys 


Pro 


Ser 










325 










330 










335 




Ser 


Ala 


Ser 


Pro 


Ala 


Tyr 


Trp 


Ser 


Val 


Gly 


Thr 


Glu 


Ser 


Gin 


Gly 


Ser 








340 










345 










350 






Gin 


Asp 


Leu 


Leu 


Cys 


Asp 


Leu 


Asp 


Ser 


lieu 


Phe 


Gin Gly Val 


Pro 


Pro 






355 










360 










365 








Asn 


Lys 


Ser 


lie 


Tyr 


Asp 


Val 


Trp 


Val 


Ser 


His 


Pro 


Arg 


Asp 


Leu 


Ala 




370 










375 










380 








Ala 


Pro 


Ala 


Pro 


Gly 


Trp 


Leu 


Leu 


Ser 


Trp 


Tyr 


Ser 


Met 
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385 390 395 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 88 amino acids 
{B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME/ KEY : Other 

(B) LOCATION: 

[T» OTHER INFORMATION: Mouse FAST-1 SID 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 18: 



Ser Tyr 


Leu 


Pro 


lie 


Tyr 


Thr 


Pro 


Asn 


Val 


Val 


Met Pro 


Leu 


Ala 


Thr 


1 






5 










10 








15 




Leu Pro 


Thr 


Thr 


Ser 


Cys 


Pro 


Gin 


Cys 


Pro 


Ser 


Ser Ala 


Ser 


Pro 


Ala 






20 










25 








30 






Tyr Trp 


Ser 


Val 


Gly 


Thr 


Glu 


Ser 


Gin Gly 


Ser 


Gin Asp Leu Leu 


Cys 




35 










40 








45 








Asp Leu 


Asp 


Ser 


Leu 


Phe 


Gin 


Gly 


Val 


Pro 


Pro Asn Lys 


Ser 


lie 


Tyr 


50 










55 










60 








Asp Val 


Trp 


Val 


Ser 


His 


Pro 


Arg 


Asp 


Leu 


Ala 


Ala Pro 


Ala 


Pro Gly 


65 








70 










75 








80 


Trp Leu 


Leu 


Ser 


Trp 


Tyr 


Ser 


Men 

















85 
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Claims 

1 . A method for detecting a compound capable of modulating TGF- 
P superfamily signaling, said method comprising the steps of: 

(a) providing a cell having: 

(i) a reporter gene operably linked to a DNA-binding-protein 

5 recognition site; 

(ii) a first fusion gene capable of expressing a first fusion 
protein, said first fusion protein comprising a polypeptide fragment of Smad2 
covalently bonded to a binding moiety, said binding moiety capable of 
specifically binding to said DNA-binding-protein recognition site; and 

1 0 (iii) a second fusion gene capable of expressing a second 

fusion protein, said second fusion protein comprising a polypeptide fragment of 
FAST-1 covalently bonded to a gene activating moiety; 

(b) exposing said cell to said compound; and 

(c) measunng reporter gene expression in said cell, a change in said 
15 reporter gene expression indicating said compound is capable of modulating 

TGF-P superfamily signaling. 

2. A method for detecting a compound capable of modulating TGF- 
(J superfamily signaling, said method comprising the steps of: 

(a) providing a cell having: 
20 (i) a reporter gene operably linked to a DNA-binding-protein 

recognition site; 

(ii) a first fusion gene capable of expressing a first fusion 
protein, said first fusion protein comprising a polypeptide fragment of FAST-1 
covalently bonded to a binding moiety ; said binding moiety capable of 
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specifically binding to said DNA-binding-protein recognition site; 

(iii) a second fusion gene capable of expressing a second 
fusion protein, said second fusion protein comprising a polypeptide fragment of 
Smad2 covalently bonded to a gene activating moiety; 
5 (b) exposing said cell to said compound; and 

(c) measuring reporter gene expression in said cell, a change in said 
reporter gene expression indicating said compound is capable of modulating 
TGF-{} superfamily signaling. 

3. A method for detecting a compound capable of modulating TGF- 
10 P superfamily signaling, said method comprising the steps of: 

(a) providing a cell having: 

(i) a reporter gene operably linked to a DNA-binding-protein 

recognition site; 

(ii) a first fusion gene capable of expressing a first fusion 
15 protein, said first fusion protein comprising a polypeptide fragment of Smad3 

covalently bonded to a binding moiety, said binding moiety capable of 
specifically binding to said DNA-binding-protein recognition site; and 

(iii) a second fusion gene capable of expressing a second 
fusion protein, said second fusion protein comprising a polypeptide fragment of 

20 FAST- 1 covalently bonded to a gene activating moiety: 

(b) exposing said cell to said compound; and 

(c) measuring reporter gene expression in said cell, a change in said 
reporter gene expression indicating said compound is capable of modulating 
TGF-P superfamily signaling. 

25 4. A method for detecting a compound capable of modulating TGF- 
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p superfamily signaling, said method comprising the steps of: 

(a) providing a cell having: 

(i) a reporter gene operably linked to a DNA-binding-protein 

recognition site; 

5 (ii) a first fusion gene capable of expressing a first fusion 

protein, said first fusion protein comprising a polypeptide fragment of FAST- 1 
covalently bonded to a binding moiety, said binding moiety capable of 
specifically binding to said DNA-binding-protein recognition site; 

(iii) a second fusion gene capable of expressing a second 
10 fusion protein, said second fusion protein comprising a polypeptide fragment of 
Smad3 covalently bonded to a gene activating moiety; 

(b) exposing said cell to said compound; and 

(c) measuring reporter gene expression in said cell, a change in said 
reporter gene expression indicating said compound is capable of modulating 

15 TGF-p superfamily signaling. 

5. The method of claim 1, 2, 3 or 4, wherein a decrease in said 
reporter gene expression indicates said compound is capable of inhibiting TGF- 
P superfamily signaling and an increase in said reporter gene expression 
indicates said compound is capable of enhancing TGF-P superfamily signaling. 

20 6. The method of claim 1, 2, 3 or 4. wherein said polypeptide 

fragment of FAST- 1 comprises a FAST-1 SID, wherein said SID consists of, at 
maximum, a polypeptide fragment selected from the group consisting of: 
Xenopus FAST-1 amino acids 380 to 506, human FAST-1 amino acids 234 to 
365, and mouse FAST-1 ammo acids 309 to 398. 
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7. The method of claim 1 or 2, wherein said polypeptide fragment of 
Smad2 consists of, at maximum, Smad2 amino acids 248 to 467, or amino 
acids 274 to 467. 

8. The method of claim 3 or 4, wherein said polypeptide fragment of 
5 Smad3 consists of, at maximum, a polypeptide fragment selected from the 

group consisting of: Smad3 amino acids 230 to 446, amino acids 253 to 446, 
amino acids 230 to 424, or amino acids 253 to 424. 

9. The method of claim 1, 2, 3, or 4, wherein said cell is a yeast cell. 

10. The method of claim 1, 2, 3 or 4, wherein said reporter gene 
10 expression is assayed by a color reaction. 

1 1 . The method of claim 1.2,3, or 4, wherein said reporter gene 
expression is assayed by cell viability. 

12. A cell for detecting a compound capable of modulating TGF-P 
superfamily signaling, said cell having: 

15 (a) a reporter gene operably linked to a DNA-binding-protein 

recognition site; 

(b) a first fusion gene capable of expressing a first fusion protein, 
said first fusion protein comprising a polypeptide fragment of Smad2 
covalently bonded to a binding moiety, said binding moiety capable of 

20 specifically binding to said DNA-binding-protein recognition site; and 

(c) a second fusion gene capable of expressing a second fusion 
protein, said second fusion protein comprising a polypeptide fragment of 
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FAST-1 covalently bonded to a gene activating moiety. 

13. A cell for detecting a compound capable of modulating TGF-P 
superfamily signaling, said cell having: 

(a) a reporter gene operably linked to a DN A-binding-protein 
5 recognition site; 

(b) a first fusion gene capable of expressing a first fusion protein, 
said first fusion protein comprising a polypeptide fragment of FAST- 1 
covalently bonded to a binding moiety, said binding moiety capable of 
specifically binding to said DN A -binding-protein recognition site; and 

10 (c) a second fusion gene capable of expressing a second fusion 

protein, said second fusion protein comprising a polypeptide fragment of 
Smad2 covalently bonded to a gene activating moiety. 

14. A cell for detecting a compound capable of modulating TGF-P 
superfamily signaling, said cell having: 

15 (a) a reporter gene operably linked to a DN A-binding-protein 

recognition site; 

(b) a first fusion gene capable of expressing a first fusion protein, 
said first fusion protein comprising a polypeptide fragment of Smad3 
covalently bonded to a binding moiety, said binding moiety capable of 

20 specifically binding to said DNA-binding-protein recognition site; and 

(c) a second fusion gene capable of expressing a second fusion 
protein, said second fusion protein comprising a polypeptide fragment of 
FAST-1 covalently bonded to a gene activating moiety. 

15. A cell for detecting a compound capable of modulating TGF-p 
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superfamily signaling, said cell having: 

(a) a reporter gene operably linked to a DNA-binding-protein 
recognition site; 

(b) a first fusion gene capable of expressing a first fusion protein, 
5 said first fusion protein comprising a polypeptide fragment of FAST-1 

covalently bonded to a binding moiety, said binding moiety capable of 
specifically binding to said DNA-binding-protein recognition site; and 

(c) a second fusion gene capable of expressing a second fusion 
protein, said second fusion protein comprising a polypeptide fragment of 

10 Smad3 covalently bonded to a gene activating moiety. 



16. The cell of claim 12, 13, 14, or 15, wherein a decrease in said 
reporter gene expression indicates said compound is capable of inhibiting TGF- 
P superfamily signaling and an increase in said reporter gene expression 
indicates said compound is capable of enhancing TGF-P superfamily signaling. 

15 17. The cell of claim 12. 13, 14, or 15, wherein said polypeptide 

fragment of FAST-1 consists of. at maximum, a polypeptide fragment selected 
from the group consisting of: Xenopus FAST-1 amino acids 380 to 506, human 
FAST-1 amino acids 234 to 365, and mouse FAST-1 amino acids 309 to 398. 



18. The cell of claim 12 or 13, wherein said polypeptide fragment of 
20 Smad2 consists, at maximum, of Smad2 amino acids 248 to 467, or amino 
acids 274 to 467. 



19. The cell of claim 14 or 15, wherein said polypeptide fragment of 
Smad3 consists, at maximum, of a polypeptide fragment selected from the 
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group consisting of: Smad3 amino acids 230 to 446, amino acids 253 to 446, 
amino acids 230 to 424, or amino acids 253 to 424. 

20. The cell of claim 12, 13, 14, or 15, wherein said cell is a yeast 

cell. 

5 21. The cell of claim 12, 13, 14, or 1 5, wherein said reporter gene 

expression is assayed by a color reaction. 

22. The cell of claim 12, 13, 14, or 15, wherein said reporter gene 
expression is assayed by cell viability. 

23. A method for detecting a compound capable of modulating 
10 TGF-p superfamily signaling, said method comprising the steps of: 

(a) providing a first polypeptide, said first polypeptide comprising a 
polypeptide fragment of FAST- 1; 

(b) providing a second polypeptide, said second polypeptide 
comprising a polypeptide fragment of Smad2; 

15 (c) exposing said first polypeptide to said second polypeptide and to 

said compound; and 

(d) measuring the level of interaction between said first polypeptide 
and said second polypeptide, an alteration in said level of interaction indicating 
said compound is capable of modulating TGF-P superfamily signaling. 

20 24. A method for detecting a compound capable of modulating 

TGF-P superfamily signaling, said method comprising the steps of: 

(a) providing a first polypeptide, said first polypeptide comprising a 
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polypeptide fragment of Smad2; 

(b) providing a second polypeptide, said second polypeptide 
comprising a polypeptide fragment of FAST- 1; 

(c) exposing said first polypeptide to said second polypeptide and to 
said compound; and 

(d) measuring the level of interaction between said first polypeptide 
and said second polypeptide, an alteration in said level of interaction indicating 
said compound is capable of modulating TGF-P superfamily signaling. 

25. A method for detecting a compound capable of modulating 
TGF-P superfamily signaling, said method comprising the steps of: 

(a) providing a first polypeptide, said first polypeptide comprising a 
polypeptide fragment of FAST- 1; 

(b) providing a second polypeptide, said second polypeptide 
comprising a polypeptide fragment of Smad3; 

(c) exposing said first polypeptide to said second polypeptide and to 
said compound; and 

(d) measuring the level of interaction between said first polypeptide 
and said second polypeptide, an alteration in said level of interaction indicating 
said compound is capable of modulating TGF-P superfamily signaling. 

26. A method for detecting a compound capable of modulating 
TGF-P superfamily signaling, said method comprising the steps of: 

(a) providing a first polypeptide, said first polypeptide comprising a 
polypeptide fragment of Smad3; 

(b) providing a second polypeptide, said second polypeptide 
comprising a polypeptide fragment of FAST- 1; 
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(c) exposing said first polypeptide to said second polypeptide and to 
said compound; and 

(d) measuring the level of interaction between said first polypeptide 
and said second polypeptide, an alteration in said level of interaction indicating 

5 said compound is capable of modulating TGF-P superfamily signaling. 

27. The method of claim 23, 24, 25, or 26, wherein at least one of 
said first polypeptide or said second polypeptide is immobilized on a solid- 
phase substance. 

10 28, The method of claim 23, 24, 25, or 26, wherein a decrease in 

said level of interaction indicates that said compound is capable of inhibiting 
TGF-p superfamily signaling, and wherein an increase in said level of 
interaction indicates that said compound is capable of enhancing TGF-p 
superfamily signaling. 

15 29. The method of claim 23, 24, 25, or 26, wherein said polypeptide 

fragment of FAST- 1 consists of, at maximum, a polypeptide fragment selected 
from the group consisting of: Xenopus FAST-1 amino acids 380 to 506, human 
FAST-1 amino acids 234 to 365, and mouse FAST-1 amino acids 309 to 398. 

30. The method of claim 23 or 24, wherein said polypeptide 

20 fragment of Smad2 consists, at maximum, of Smad2 amino acids 248 to 467 ? or 
amino acids 274 to 467. 

3 1 . The method of claim 25 or 26, wherein said polypeptide 
fragment of Smad3 consists, at maximum, of a polypeptide fragment selected 



WO 98/53830 



PCT/US98/10983 



-82- 

from the group consisting of: Smad3 amino acids 230 to 446, amino acids 253 
to 446, amino acids 230 to 424, or amino acids 253 to 424. 

32. The method of claim 23, 24, 25, or 26, wherein said first 
polypeptide is produced by a cell which contains a first fusion gene capable of 

5 expressing said first polypeptide. 

33. The method of claim 23, 24, 25, or 26, wherein said second 
polypeptide is produced by a cell which contains a second gene capable of 
expressing said second polypeptide. 

10 34. A method for detecting a compound capable of modulating 

TGF-f3 superfamily signaling, said method comprising the steps of: 

(a) providing a reporter gene operably linked to a DNA-binding- 
protein recognition site; 

(b) providing a first fusion protein, said first fusion protein 

15 comprising a polypeptide fragment of FAST- 1 covalently bonded to a binding 
moiety, said binding moiety capable of specifically binding to said DNA- 
binding-protein recognition site; 

(c) providing a second fusion protein, said second fusion protein 
comprising a polypeptide fragment of Smad2 covalently bonded to a gene 

20 activating moiety; 

(d) exposing said first fusion protein to said second fusion protein, 
to said reporter gene, and to said compound; and 

(e) measuring the reporter gene expression, a change in said reporter 
gene expression indicating a compound capable of modulating TGF-P 

25 superfamily signaling. 
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35. A method for detecting a compound capable of modulating 
TGF-P superfamily signaling, said method comprising the steps of: 

(a) providing a reporter gene operably linked to a DNA-binding- 
protein recognition site; 
5 (b) providing a first fusion protein, said first fusion protein 

comprising a polypeptide fragment of Smad2 covalently bonded to a binding 
moiety, said binding moiety capable of specifically binding to said DNA- 
binding-protein recognition site; 

(c) providing a second fusion protein, said second fusion protein 
10 comprising a polypeptide fragment of FAST- 1 covalently bonded to a gene 

activating moiety; 

(d) exposing said first fusion protein to said second fusion protein, 
to said reporter gene, and to said compound; and 

(e) measuring the reporter gene expression, a change in said reporter 
1 5 gene expression indicating a compound capable of modulating TGF-p 

superfamily signaling. 

36. A method for detecting a compound capable of modulating 
TGF-p superfamily signaling, said method comprising the steps of: 

(a) providing a reporter gene operably linked to a DNA-binding- 
20 protein recognition site; 

(b) providing a first fusion protein, said first fusion protein 
composing a polypeptide fragment of FAST- 1 covalently bonded to a binding 
moiety, said binding moiety capable of specifically binding to said DNA- 
binding-protein recognition site; 

25 (c) providing a second fusion protein, said second fusion protein 

comprising a polypeptide fragment of Smad3 covalently bonded to a gene 
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activating moiety; 

(d) exposing said first fusion protein to said second fusion protein, 
to said reporter gene, and to said compound; and 

(e) measuring the reporter gene expression, a change in said reporter 
5 gene expression indicating a compound capable of modulating TGF-0 

superfamily signaling. 

37. A method for detecting a compound capable of modulating 
TGF-P superfamily signaling, said method comprising the steps of: 

(a) providing a reporter gene operably linked to a DNA-binding- 
10 protein recognition site; 

(b) providing a first fusion protein, said first fusion protein 
comprising a polypeptide fragment of Smad3 covalently bonded to a binding 
moiety, said binding moiety capable of specifically binding to said DNA- 
binding-protein recognition site; 

15 (c) providing a second fusion protein, said second fusion protein 

comprising a polypeptide fragment of FAST- 1 covalently bonded to a gene 

activating moiety; 

(d) exposing said first fusion protein to said second fusion protein, 

to said reporter gene, and to said compound; and 
20 (e) measuring the reporter gene expression, a change in said reporter 

gene expression indicating a compound capable of modulating TGF-p 

superfamily signaling. 

38. The method of claim 34, 35, 36, or 37, wherein a decrease in said 
reporter gene expression indicates said compound is capable of inhibiting TGF- 
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P superfamily signaling and an increase in said reporter gene expression 
indicates said compound is capable of enhancing TGF-P superfamily signaling. 

39. The method of claim 34, 35, 36, or 37, wherein said polypeptide 
fragment of FAST- 1 consists of, at maximum, a polypeptide fragment selected 

5 from the group consisting of: Xenopus FAST-1 amino acids 380 to 506, human 
FAST-1 amino acids 234 to 365, and mouse FAST-1 amino acids 309 to 398. 

40. The method of claim 34 or 35, wherein said polypeptide 
fragment of Smad2 consists, at maximum, of Smad2 amino acids 248 to 467, or 
amino acids 274 to 467. 

10 41. The method of claim 36 or 37, wherein said polypeptide 

fragment of Smad3 consists, at maximum, of a polypeptide selected from the 
group consisting of: Smad3 amino acids 230 to 446, amino acids 253 to 446, 
amino acids 230 to 424, or amino acids 253 to 424. 

42, The method of claim 34, 35, 36, or 37, wherein providing said 
15 first fusion protein comprises providing a first fusion gene capable of 

expressing said first fusion protein and wherein providing said second fusion 
protein comprises providing a second fusion gene capable of expressing said 
second fusion protein. 

43. A method for diagnosing a mammal having or likely to develop 
20 a disorder involving abnormal TGF-P superfamily signaling, said method 

comprising determining whether said mammal has a mutation in a gene 
encoding FAST-1. 
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44. The method of claim 43, wherein said mutation is in the Smad 
Interaction Domain (SID). 

45. A method for diagnosing a mammal having or likely to develop 
a disorder involving abnormal TGF-(J superfamily signaling, said method 

5 comprising determining whether said mammal has an altered level of 
expression of FAST-1. 

46. The method of claim 43 or 45, wherein said disorder is a 
developmental disorder. 

47. The method of claim 43 or 45, wherein said mammal is a 

10 human. 

48. The method of claim 47, wherein said human is a fetus. 

49. A substantially pure FAST-1 protein or polypeptide fragment 
thereof, wherein said protein or said polypeptide fragment is from a mammal, 
wherein said protein or polypeptide fragment is for use in modulating TGF-P 

1 5 superfamily signaling. 

50. The protein or polypeptide fragment of claim 49, wherein said 
mammal is a human. 

5 1 . The protein or polypeptide fragment of claim 49, wherein said 
mammal is a rodent. 
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52. The polypeptide fragment of claim 49, wherein said polypeptide 
fragment comprises the Smad Interaction Domain (SID). 

53. A substantially pure polypeptide fragment, wherein said 
polypeptide fragment is a polypeptide fragment of FAST- 1, wherein said 
FAST-1 is from Xenopus, wherein said polypeptide fragment comprises the 
Smad Interaction Domain (SID), wherein said polypeptide fragment is for use 
in modulating TGF-P superfamily signaling. 

54. A substantially pure polypeptide, wherein said polypeptide has 
about 50% or greater amino acid sequence identity to the amino acid sequence 
of a substantially pure mammalian FAST-1 protein, or polypeptide fragment 
thereof, wherein said protein or said polypeptide fragment is for use in 
modulating TGF-P superfamily signaling. 

55. A substantially pure polypeptide, wherein said polypeptide has 
about 75% or greater amino acid sequence identity to the ammo acid sequence 
of a substantially pure mammalian FAST-1 protein, or polypeptide fragment 
thereof, wherein said protein or said polypeptide fragment is for use in 
modulating TGF-p superfamily signaling. 

56. A substantially pure polypeptide, wherein said polypeptide has 
about 90% or greater ammo acid sequence identity to the amino acid sequence 
of a substantially pure mammalian FAST-1 protein, or polypeptide fragment 
thereof, wherein said protein or said polypeptide fragment is for use in 
modulating TGF-P superfamily signaling. 
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57. A substantially pure nucleic acid, wherein said nucleic acid 
encodes a mammalian FAST-1 protein, or polypeptide fragment thereof, 
wherein said protein or said polypeptide fragment is for use in modulating 
TGF-P superfamily signaling. 

5 58. A vector comprising a substantially pure nucleic acid, wherein 

said nucleic acid encodes a mammalian FAST-1 protein, or polypeptide 
fragment thereof, wherein said protein or said polypeptide fragment is for use 
in modulating TGF-P superfamily signaling, and wherein said vector is capable 
of directing expression of said protein or said polypeptide fragment in a cell 
1 0 containing said vector. 

59. A vector comprising a substantially pure nucleic acid, wherein 
said nucleic acid encodes a FAST-1 Smad Interaction Domain (SID), wherein 
said SID is for use in modulating TGF-p superfamily signaling, and wherein 
said vector is capable of directing expression of said SID in a cell containing 

1 5 said vector. 

60. A cell that contains a vector comprising a substantially pure 
nucleic acid, wherein said nucleic acid encodes a mammalian FAST-1 protein, 
or polypeptide fragment thereof wherein said protein or said polypeptide 
fragment is for use in modulating TGF-P superfamily signaling, and wherein 

20 said vector is capable of directing expression of said protein or said polypeptide 
fragment. 

61 . A method of modulating TGF-P superfamily signaling in a cell, 
said method comprising providing a cell with a substantially pure FAST-1 



WO 98/53830 



PCT/US98/10983 



-89- 

protein, or polypeptide fragment thereof, wherein said FAST-1 protein or 
polypeptide fragment is provided intracellular^, and wherein said FAST-1 
protein or polypeptide fragment is sufficient to modulate TGF-0 superfamily 
signaling in a cell. 

5 62. A method of modulating TGF-0 superfamily signaling in a cell, 

said method comprising introducing, into a cell, a vector comprising a 
substantially pure nucleic acid, wherein said nucleic acid encodes a 
substantially pure FAST-1 protein, or polypeptide fragment thereof, wherein 
said vector is capable of directing expression of said protein or said polypeptide 
10 fragment in a cell containing said vector, and wherein expression of said 
FAST-1 protein or polypeptide fragment is sufficient to modulate TGF-P 
superfamily signaling in a cell 

63. The method of claim 61 or 62, wherein said signaling is 

decreased. 



15 



64. The method of claim 61 or 62, wherein said signaling is 

increased. 
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Figure 5B 
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